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N  Carolina  A&T  State  Univ  ,  Greensboro  ,  NC 

Optimal  Structural  Design  of  Modular  Composite  bare  base  Shelters 

DR  Joe  G  Chow  WRALC/TI -  6-  2' 

Florida  International  Univ  ,  Miami ,  FL 

Re-engineer  and  Re-Manufacture  Aircraft  Sstnictural  Components  Using  Laser  Scanning 
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1.  INTRODUCTION 


The  Summer  Research  Program  (SRP).  sponsored  by  the  Air  Force  Office  of  Scientific  Research 
(AFOSR),  offers  paid  opportunities  for  university  faculty,  graduate  students,  and  high  school  students 
to  conduct  research  in  U.S.  Air  Force  research  laboratories  nationwide  during  the  summer. 

Introduced  by  AFOSR  in  1978,  this  innovative  program  is  based  on  the  concept  of  teaming  academic 
researchers  with  Air  Force  scientists  in  the  same  disciplines  using  laboratory  facilities  and  equipment 
not  often  available  at  associates'  institutions. 

The  Summer  Faculty  Research  Program  (SFRP)  is  open  annually  to  approximately  150  faculty 
members  with  at  least  two  years  of  teaching  and/or  research  experience  in  accredited  U.S.  colleges, 
universities,  or  technical  institutions.  SFRP  associates  must  be  either  U.S.  citizens  or  permanent 
residents. 

The  Graduate  Student  Research  Program  (GSRP)  is  open  annually  to  approximatel)  100  graduate 
students  holding  a  bachelor's  or  a  master's  degree;  GSRP  associates  must  be  U.S.  citizens  enrolled  full 
time  at  an  accredited  institution. 

The  High  School  Apprentice  Program  (HSAP)  annually  selects  about  125  high  school  students  located 
within  a  twenty  mile  commuting  distance  of  participating  Air  Force  laboratories. 

AFOSR  also  offers  its  research  associates  an  opportunity,  under  the  Summer  Research  Extension 
Program  (SREP),  to  continue  their  AFOSR-sponsored  research  at  their  home  institutions  through  the 
award  of  research  grants.  In  1994  the  maximum  amount  of  each  grant  was  increased  from  $20,000  to 
$25,000,  and  the  number  of  AFOSR-sponsored  grants  decreased  from  75  to  60.  A  separate  annual 
report  is  compiled  on  the  SREP. 

The  numbers  of  projected  summer  research  participants  in  each  of  the  three  categories  and  SREP 
“grants”  are  usually  increased  through  direct  sponsorship  by  participating  laboratories. 

AFOSR' s  SRP  has  well  served  its  objectives  of  building  critical  links  between  Air  Force  research 
laboratories  and  the  academic  community,  opening  avenues  of  communications  and  forging  new 
research  relationships  between  Air  Force  and  academic  technical  experts  in  areas  of  national  interest, 
and  strengthening  the  nation's  efforts  to  sustain  careers  in  science  and  engineering.  The  success  of  the 
SRP  can  be  gauged  from  its  growth  from  inception  (see  Table  1)  and  from  the  favorable  responses  the 
1997  participants  expressed  in  end-of-tour  SRP  evaluations  (Appendix  B). 

AFOSR  contracts  for  administration  of  the  SRP  by  civilian  contractors.  The  contract  was  first 
awarded  to  Research  &  Development  Laboratories  (RDL)  in  September  1990.  After  completion  of  the 


1 


1990  contract,  RDL  (in  1993)  won  the  recompetition  for  the  basic  year  and  four  1-year  options. 


2.  PARTICIPATION  IN  THE  SUMMER  RESEARCH  PROGRAM 

The  SRP  began  with  faculty  associates  in  1979;  graduate  students  were  added  in  1982  and  high  school 
students  in  1986.  The  following  table  shows  the  number  of  associates  in  the  program  each  year . 


YEAR 

SRP  Participation,  by  Year 

TOTAL  | 

SFRP 

GSRP 

HSAP 

' 9 

1 

- 1 

1979 

70 

70  I 

1980 

87 

87  | 

- -1 

1981 

87 

87  ! 

1982 

91 

17 

108  | 

1983 

101 

53 

154  j 

1984 

152 

84 

236  j 

1985 

154 

92 

246 

1986 

oo 

100 

42 

300 

1987 

159 

101 

73 

333 

1988 

153 

107 

101 

361 

1989 

16S 

102 

103 

373 

1990 

165 

121 

132 

418 

1991 

170 

142 

132 

444  j 

j  1992 

1S5 

121 

159 

464  ! 

1993 

IS" 

117 

136 

440  | 

1994 

192 

117 

133 

442 

1995 

190 

115 

137 

442 

1996 

18S 

109 

138 

435 

1997 

148 

98 

140 

427 
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Beginning  in  1993,  due  to  budget  cuts,  some  of  the  laboratories  weren’t  able  to  afford  to  fund  as  many 
associates  as  in  previous  years.  Since  then,  the  number  of  funded  posiuons  has  remained  fairly 

constant  at  a  slightly  lower  level. 


3.  RECRUITING  AND  SELECTION 

The  SRP  is  conducted  on  a  nationally  advertised  and  competitive-selection  basis.  ^^advertising  for 
faculty  and  graduate  students  consisted  primarily  of  the  mailing  of  8,000  52-page  SRP  brochures  to 
chairpersons  of  departmenis  relevant  to  AFOSR  research  and  to  administrators  of  grants  m  accredited 
unities,  coUeges.  and  technical  institutions.  Historically  Black  Colleges  and  Untversmes 
(HBCUs)  and  Minority  Institutions  (Mis)  were  included.  Brochures  also  went  to  a  p  lc!?  ? 
USAF  laboratories,  the  previous  year's  participants,  and  numerous  individual  requesters  (over  1000 

annually). 

RDL  placed  advertisements  in  the  following  publications:  Black  Issues  in  Higher  Education,  K.nds  of 
Change  and  IEEE  Spectrum.  Because  no  participants  list  either  Physics  Today  or  Chen^a 
Engineering  News  as  being  their  source  of  learning  about  the  program  for  the  past  several  years, 
advertisements  in  these  magazines  were  dropped,  and  the  funds  were  used  to  cover  increases  in 

brochure  printing  costs. 

High  school  applicants  can  participate  only  in  laboratories  located  no  more  than  20  miles  from  their 
residence.  Tailored  brochures  on  the  HSAP  were  sent  to  the  head  counselors  of  180  high  schools  in 
the  vicinity  of  participating  laboratories,  with  instructions  for  publicizing  the  program  m  their  schools. 
High  school  students  selected  to  serve  at  Wright  Laboratory's  Armament  Directorate  (Eglrn  Air  Force 
Base,  Florida)  serve  eleven  weeks  as  opposed  to  the  eight  weeks  normally  worked  by  high  school 
students  at  all  other  participating  laboratories. 

Each  SFRP  or  GSRP  applicant  is  given  a  first,  second,  and  third  choice  of  laboratory.  High  school 
students  who  have  more  than  one  laboratory  or  directorate  near  their  homes  are  also  given  first, 
second,  and  third  choices. 


Laboratories  make  their  selections  and  prioritize  their  nominees.  AFOSR  then  determines  the  number 
to  be  funded  at  each  laboratory  and  approves  laboratories  selections. 


Subsequently,  laboratories  use  their  own  hinds  to  sponsor  additional  candidates.  Some  selectees  do 
not  accept  the  appointment,  so  alternate  candidates  are  chosen.  This  multi-step  selection  procedure 
results  in  some  candidates  being  notified  of  their  acceptance  after  scheduled  deadlines.  Tne  total 
applicants  and  participants  for  1997  are  shown  in  this  table. 
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1997  Applicants  and  Participants 

PARTICIPANT 

CATEGORY 

TOTAL 

APPLICANTS 

SELECTEES 

DECLINING 

SELECTEES 

SFRP 

490 

188 

32 

(HBCU/MI) 

(0) 

(0  ) 

(0) 

GSRP 

202 

98 

9 

(HBCU/MI) 

(0) 

(0) 

(0) 

HSAP 

433 

140 

14 

TOTAL 

1125 

426 

55 

4.  SITE  VISITS 

During  June  and  July  of  1997,  representativ  es  of  both  AFOSR/NI  and  RDL  visited  each  participating 
laboratory  to  provide  briefings,  answer  questions,  and  resolve  problems  for  both  laboratory  personnel 
and  participants.  The  objective  was  to  ensure  that  the  SRP  would  be  as  constructive  as  possible  for  all 
participants.  Both  SRP  participants  and  RDL  representatives  found  these  visits  beneficial.  At  many  of 
the  laboratories,  this  was  the  only  opportunity  for  all  participants  to  meet  at  one  time  to  share  their 
experiences  and  exchange  ideas. 


5.  HISTORICALLY  BLACK  COLLEGES  AND  UNIVERSITIES  AND  MINORITY 
INSTITUTIONS  (HBCU/MIs) 

Before  1993,  an  RDL  program  representative  visited  from  seven  to  ten  different  HBCU/MIs  annually 
to  promote  interest  in  the  SRP  among  the  faculty  and  graduate  students.  These  efforts  were  marginally 
effective,  yielding  a  doubling  of  HBCI/MI  applicants.  In  an  effort  to  achieve  AFOSR's  goal  of  10% 
of  all  applicants  and  selectees  being  HBCU/MI  qualified,  the  RDL  team  decided  to  try  other  avenues 
of  approach  to  increase  the  number  of  qualified  applicants.  Through  the  combined  efforts  of  the 
AFOSR  Program  Office  at  Bolling  AFB  and  RDL,  two  very  active  minority  groups  were  found, 
HACU  (Hispanic  American  Colleges  and  Universities)  and  AISES  (American  Indian  Science  and 
Engineering  Society).  RDL  is  in  communication  with  representatives  of  each  of  these  organizations  on 
a  monthly  basis  to  keep  up  with  the  their  activities  and  special  events.  Both  organizations  have 
widely-distributed  magazines/quarterlies  in  which  RDL  placed  ads. 

Since  1994  the  number  of  both  SFRP  and  GSRP  HBCU/MI  applicants  and  participants  has  increased 
ten-fold,  from  about  two  dozen  SFRP  applicants  and  a  half  dozen  selectees  to  over  100  applicants  and 
two  dozen  selectees,  and  a  half-dozen  GSRP  applicants  and  two  or  three  selectees  to  18  applicants  and 
7  or  8  selectees.  Since  1993,  the  SFRP  had  a  two-fold  applicant  increase  and  a  two-fold  selectee 
increase.  Since  1993,  the  GSRP  had  a  three-fold  applicant  increase  and  a  three  to  four-fold  increase  in 
selectees. 
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In  addition  to  RDL’s  special  recruiting  efforts,  AFOSR  attempts  each  year  to  obtain  additional  funding 
or  use  leftover  funding  from  cancellations  the  past  year  to  fund  HBCU/MI  associates.  This  year,  5 
HBCU/MI  SFRPs  declined  after  they  were  selected  (and  there  was  no  one  qualified  to  replace  them 
with).  The  following  table  records  HBCU/MI  participation  in  this  program. 


SRP  HBCU/MI  Participation,  By  Year 

YEAR 

SFRP 

GSRP 

Applicants 

Participants 

Applicants 

Participants 

1985 

76 

23 

15 

11 

1986 

70 

18 

20 

■KaMII 

1987 

82 

32 

32 

II 

1988 

53 

17 

23 

14 

1989 

39 

15 

13 

4 

1990 

43 

14 

17 

3 

1991 

42 

13 

8 

5 

1992 

70 

13 

9 

5 

1993 

60 

13 

6 

2 

1994 

90 

16 

11 

6 

1995 

90 

21 

20 

8 

1996 

119 

27 

18 

7 

6.  SRP  FUNDING  SOURCES 

Funding  sources  for  the  1997  SRP  were  the  AFOSR-provided  slots  for  the  basic  contract  and 
laboratory  funds.  Funding  sources  by  category  for  the  1997  SRP  selected  participants  are  shown  here. 
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1997  SRP  FUNDING  CATEGORY 

SFRP 

GSRP 

HSAP 

AFOSR  Basic  Allocation  Funds 

141 

89 

123 

USAF  Laboratory  Funds 

48 

9 

17 

HBCU/MI  By  AFOSR 
(Using  Procured  Addn’l  Funds) 

0 

0 

N/A 

TOTAL 

9 

98 

140 

SFRP  - 188  were  selected,  but  thirty  two  canceled  too  late  to  be  replaced. 
GSRP  -  98  were  selected,  but  nine  canceled  too  late  to  be  replaced. 
jjSAp .  140  were  selected,  but  fourteen  canceled  too  late  to  be  replaced. 


7.  COMPENSATION  FOR  PARTICIPANTS 


Compensation  for  SRP  participants,  per  five-day  work  week,  is  shown  in  this  table. 


1997  SRP  Associate  Compensation 


1  77 

PARTICIPANT  CATEGORY 

/  oiM  -nu 

1991 

1992 

1993 

1994 

1995 

1996 

1997 

Faculty  Members 

$690 

$718 

$740 

$740 

S740 

$770 

$770 

Graduate  Student 
(Master’s  Degree) 

$425 

$442 

$455 

$455 

S455 

$470 

$470 

Graduate  Student 

(Bachelor's  Degree) 

S365 

$391 

S391 

$400 

$400 

High  School  Student 
(First  Year) 

$200 

$200 

$200 

$200 

S200 

High  School  Student 
(Subsequent  Years) 

$240 

$240 

$240 

S240 

S240 

S240 

$240 

The  program  also  offered  associates  whose  homes  were  more  than  50  miles  from  the  laboratory  an 
expense  allowance  (seven  days  per  week)  of  $50/day  for  faculty  and  540.  day  for  graduate  students. 
Transportation  to  the  laboratory  at  the  beginning  of  their  tour  and  back  to  their  home  destinations  at 
the  end  was  also  reimbursed  for  these  participants.  Of  the  combined  SFRP  and  GSRP  associates, 

65  %  (194  out  of  286)  claimed  travel  reimbursements  at  an  average  round-trip  cost  of  $776. 


Faculty  members  were  encouraged  to  visit  their  laboratories  before  their  summer  tour  began.  All  costs 
of  these  orientation  visits  were  reimbursed.  Forty-three  percent  (85  out  of  188)  of  faculty  associates 
took  orientation  trips  at  an  average  cost  of  S388.  By  contrast,  in  1993,  58  %  of  SFRP  associates  took 
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orientation  visits  at  an  average  cost  of  S685;  that  was  the  highest  percentage  of  associates  opting  to 
take  an  orientation  trip  since  RDL  has  administered  the  SRP,  and  the  highest  average  cost  of  an 
orientation  trip.  These  1993  numbers  are  included  to  show  the  fluctuation  which  can  occur  in  these 
numbers  for  planning  purposes. 

Program  participants  submitted  biweekly  vouchers  countersigned  by  their  laboratory  research  tocal 
point,  and  RDL  issued  paychecks  so  as  to  arrive  in  associates'  hands  two  weeks  later. 

This  is  the  second  year  of  using  direct  deposit  for  the  SFRP  and  GSRP  associates.  The  process  went 
much  more  smoothly  with  respect  to  obtaining  required  information  from  the  associates,  only  1%  of 
the  associates’  information  needed  clarification  in  order  for  direct  deposit  to  properly  function  as 
opposed  to  10%  from  last  year.  The  remaining  associates  received  their  stipend  and  expense  payments 
via  checks  sent  in  the  US  mail. 


HSAP  program  participants  were  considered  actual  RDL  employees,  and  their  respective  state  and 
federal  income  tax  and  Social  Security'  were  withheld  from  their  paychecks.  By  the  nature  of  their 
independent  research,  SFRP  and  GSRP  program  participants  were  considered  to  be  consultants  or 
independent  contractors.  As  such,  SFRP  and  GSRP  associates  were  responsible  for  their  own  income 
taxes,  Social  Security,  and  insurance. 

8.  CONTENTS  OF  THE  1997  REPORT 

The  complete  set  of  reports  for  the  1997  SRP  includes  this  program  management  report  (Volume  1) 
augmented  by  fifteen  volumes  of  final  research  reports  by  the  1997  associates,  as  indicated  below: 


1997  SRP  Final  Report  Volume  Assignments 


LABORATORY 

..r 

SFRP 

- - 

GSRP 

HSAP 

j  Armstrong 

7 

12 

1  Phillips 

3 

8 

13 

j  Rome 

4 

9 

14 

Wright 

5A.  5B 

10 

15 

AEDC,  ALCs,  VYHMC 

6 

11 

16  1 

7 


APPENDIX  A  -  PROGRAM  STATISTICAL  SUMMARY 


A.  Colleges/Universities  Represented 

Selected  SFRP  associates  represented  169  different  colleges,  universities,  and  institutions 
GSRP  associates  represented  95  different  colleges,  universities,  and  institutions. 


B.  States  Represented 

SFRP  -Applicants  came  tom  47  states  pins  Washington  D.C.  Selectees  represent  44  states 
GSRP  -  Applicants  came  tom  44  states.  Selectees  represent  32  states. 

HSAP  -  Applicants  came  from  thirteen  states.  Selectees  represent  nine  states. 


Total  Number  of  Participants 

SFRP 

189 

GSRP 

97 

HSAP 

140 

TOTAL 

426 

Degrees  Represented 

SFRP 

GSRP 

TOTAL 

Doctoral 

184 

0 

184 

Master's 

2 

41 

43 

Bachelor's 

0 

56 

56 

TOTAL 

186 

97 

298 
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APPENDIX  B  -  SRP  EVALUATION  RESPONSES 


1.  OVERVIEW 

Evaluations  were  completed  and  returned  to  RDL  by  four  groups  at  the  completion  of  the  SRP.  The 
number  of  respondents  in  each  group  is  shown  below. 


Table  B-l .  Total  SRP  Evaluations  Received 


Evaluation  Group 

Responses 

SFRP  &  GSRPs 

275 

HSAPs 

113 

USAF  Laboratory  Focal  Points 

84 

USAF  Laboratory  HSAP  Mentors 

6 

All  groups  indicate  unanimous  enthusiasm  for  the  SRP  experience. 

The  summarized  recommendations  for  program  improvement  from  both  associates  and  laboratory 
personnel  are  listed  below: 


A.  Better  preparation  on  the  labs’  part  prior  to  associates'  arrival  (i.e.,  office  space, 
computer  assets,  clearly  defined  scope  of  work). 

B.  Faculty  Associates  suggest  higher  stipends  for  SFRP  associates. 

C.  Both  HSAP  Air  Force  laboratory  mentors  and  associates  would  like  the  summer  tour 
extended  from  the  current  8  weeks  to  either  10  or  11  w  eeks;  the  groups  state  it  takes  4- 
6  weeks  just  to  get  high  school  students  up-to-speed  on  what  s  going  on  at  laboratory. 
(Note:  this  same  argument  was  used  to  raise  the  faculty  and  graduate  student 
participation  time  a  few  years  ago.) 
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2.  1997  USAF  LABORATORY  FOCAL  POINT  (LFP)  EVALUATION  RESPONSES 


The  summarized  results  Usted  below  are  from  the  84  LFP  evaluations  received. 
1 .  LFP  evaluations  received  and  associate  preferences: 


Table  B-2.  Air  Force  LFP  Evaluation  Responses  (By  Type) 


1  c 

How  Many  Associates  Would  You  Prefer  To  Get  ’ 

(%  Response) 

SFRP 

GSRP  (w/Univ 

Professor) 

GSRP  (w/o  Univ  Professor) 

Lab 

Evals 

Reev’d 

0 

1 

2 

3+ 

0 

1 

2 

3- 

0 

1 

2 

3  + 

AEDC 

0 

- 

- 

- 

- 

• 

- 

- 

- 

- 

- 

- 

- 

WHMC 

0 

- 

- 

- 

- 

- 

- 

- 

“ 

• 

*■ 

14 

0 

AL 

7 

28 

28 

28 

14 

54 

14 

28 

0 

86 

0 

USAFA 

1 

0 

100 

0 

0 

100 

0 

0 

0 

0 

100 

0 

0 

PL 

25 

40 

40 

16 

4 

88 

12 

0 

0 

84 

12 

4 

0 

RL 

5 

60 

40 

0 

0 

80 

10 

0 

0 

100 

0 

0 

0 

WL 

46 

30 

43 

20 

6 

78 

17 

4 

0 

93 

4 

0 

Total 

84 

32% 

50% 

13% 

5% 

80% 

11% 

6% 

0% 

73% 

23% 

4% 

0% 

LFP  Evaluation  Summary.  The  summarized  responses,  by  laboratory',  are  Usted  on  the  following 
page.  LFPs  were  asked  to  rate  the  following  questions  on  a  scale  from  1  (below  average)  to  5  (above 
average). 

2.  LFPs  involved  in  SRP  associate  appUcation  evaluation  process: 

a.  Time  available  for  evaluation  of  applications: 

b.  Adequacy  of  appUcations  for  selection  process: 

3.  Value  of  orientation  trips: 

4.  Length  of  research  tour: 

5  a.  Benefits  of  associate’s  work  to  laboratory: 
b.  Benefits  of  associate's  work  to  Air  Force: 

6.  a.  Enhancement  of  research  qualifications  for  LFP  and  staff: 

b.  Enhancement  of  research  qualifications  for  SFRP  associate: 

c.  Enhancement  of  research  qualifications  for  GSRP  associate: 

7.  a.  Enhancement  of  knowledge  for  LFP  and  staff: 

b.  Enhancement  of  knowledge  for  SFRP  associate: 

c.  Enhancement  of  knowledge  for  GSRP  associate: 

8.  Value  of  Air  Force  and  university  Unks: 

9.  Potential  for  future  coUaboration: 

10.  a.  Your  working  relationship  w  ith  SFRP: 
b.  Your  w  orking  relationship  with  GSRP: 

1 1 .  Expenditure  of  your  time  worthwhile: 

(Continued  on  next  page) 
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12.  Quality  of  program  literature  for  associate: 

13.  a.  Quality  of  RDL's  communications  with  you: 

b.  Quality  of  RDL's  communications  with  associates: 

14.  Overall  assessment  of  SRP: 


Table  B-3.  Laboratory  Focal  Point  Reponses  to  above  questions 


AEDC 

AL 

L’SAFA 

PL 

RL 

WHMC 

WL 

tt  Evals  Reev’d 

7 

1 

14 

5 

0 

46 

Question  tt 

2 

- 

86  % 

0  % 

88  % 

80  % 

- 

85  % 

2a 

- 

4.3 

n/a 

3.8 

4.0 

- 
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3.  1997  SFRP  &  GSRP  EVALUATION  RESPONSES 


The  summarized  results  listed  below  are  tom  the  257  SFRP/GSRP  evaluations  received. 


Associates  w  ere  asked  to  rate 
average)  -  by  Air  Force  base 
questions. 


the  following  questions  on  a  scale  from  1  (below  average)  to  5  (above 
results  and  over-all  results  of  the  1997  evaluations  are  listed  after  e 


1.  The  match  between  the  laboratories  research  and  your  field. 

2.  Your  working  relationship  with  your  LFP: 

3.  Enhancement  of  your  academic  qualifications: 

4.  Enhancement  of  your  research  qualifications: 

5.  Lab  readiness  for  you:  LFP,  task,  plan: 

6.  Lab  readiness  for  you:  equipment,  supplies,  facilities: 

7.  Lab  resources: 

8.  Lab  research  and  administrative  support: 

9.  Adequacy  of  brochure  and  associate  handbook: 

10.  RDL  communications  with  you: 

11.  Overall  payment  procedures: 

12.  Overall  assessment  of  the  SRP: 

13.  a.  Would  you  apply  again? 

b.  Will  you  continue  this  or  related  research? 

14.  Was  length  of  your  tour  satisfactory? 

15.  Percentage  of  associates  who  experienced  difficulties  in  finding  housing. 

16.  Where  did  you  stay  during  your  SRP  tour? 

a.  At  Home: 

b.  With  Friend: 

c.  On  Local  Economy: 

d.  Base  Quarters: 

17.  Value  of  orientation  visit: 

a.  Essential: 

b.  Convenient: 

c.  Not  Worth  Cost: 

d.  Not  Used: 

SFRP  and  GSRP  associate's  responses  are  listed  in  tabular  format  on  the  following  page. 


EM 


Table  B-4.  1997  SFRP  &  GSRP  Associate  Responses  to  SRP  Evaluation 
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4.  1997  USAF  LABORATORY  HSAP  MENTOR  EVALUATION  RESPONSES 
Not  enough  evaluations  received  (5  total)  from  Mentors  to  do  useful  summary. 
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5.  1997  HSAP  EVALUATION  RESPONSES 


The  summarized  results  listed  below  are  from  the  1 13  HSAP  evaluations  received. 

HSAP  apprentices  were  asked  to  rate  the  following  questions  on  a  scale  from 
1  (below  average)  to  5  (above  average) 


Your  influence  on  selection  of  topic/type  of  work. 

Workina  relationship  with  mentor,  other  lab  scientists. 

Enhancement  of  your  academic  qualifications. 

Technically  challenging  work. 

T  ah  readiness  for  you:  mentor,  task,  work  plan,  equipment. 

Influence  on  your  career. 

Increased  interest  in  math/ science. 

Lab  research  &  administrative  support. 

Adequacy  of  RDL’s  Apprentice  Handbook  and  administrative  materials. 

10.  Responsiveness  of  RDL  communications. 

1 1 .  Overall  payment  procedures. 

12.  Overall  assessment  of  SRP  value  to  you. 

13.  Would  you  apply  again  next  year?  Yes  (92  %) 

14.  Will  you  pursue  future  studies  related  to  this  research?  Yes  (6s  /c) 

15.  Was  Tour  length  satisfactory?  Yes  (82 
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Laboratory  Studies  of  Ionospheric  Plasma  Effects 
Produced  by  Lightning-induced  Whistler  Waves 


Min-Chang  Lee 

Plasma  Science  and  Fusion  Center 
Massachusetts  Institute  of  Technology 
Cambridge,  Massachusetts  02139 

Abstract 

Laboratory  experiments  have  been  conducted  at  MIT,  using  the  student-built  Ver¬ 
satile  Toroidal  Facility  (VTF),  to  investigate  the  ionospheric  plasma  effects  produced  by 
lightning-induced  whistler  waves.  Lower  hybrid  waves,  generated  by  the  lightning-induced 
whistler  waves,  can  cause  prominent  plasma  effects,  such  as  the  acceleration  of  electrons 
and  ions  and  the  spectral  broadening  of  plasma  waves.  Two  mechanisms  by  which  whistler 
waves  can  generate  lower  hybrid  waves  are  examined  in  the  VTF  experiments.  One  is  the 
simultaneous  excitation  of  lower  hybrid  waves  and  low-frequency  mode  waves  by  intense 
whistler  waves  (Lee  and  Kuo,  1984).  The  other  is  the  nonlinear  mode  conversion  of  whistler 
waves  into  lower  hybrid  waves  in  the  presence  of  short-scale  field-aligned  density  striations 
(Groves  et  al.,  1988).  The  effect  of  lower  hybrid  waves  on  the  spectra  of  Langmuir  waves 
are  singled  out  for  investigation.  The  results  of  laboratory  experiments  show  that  lower 
hybrid  waves  can  beat  with  Langmuir  waves  to  produce  frequency-upshifted  and  frequency- 
downshifted  Langmuir  waves  [Lee  et  al.,  1997],  broadening  the  spectra  of  Langmuir  waves. 
The  intensity  of  these  beat  waves,  however,  depends  upon  the  angle  of  wave  propagation 
with  respect  to  the  magnetic  field. 
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Laboratory  Studies  of  Ionospheric  Plasma  Effects 
Produced  by  Lightning-induced  Whistler  Waves 


Min-Chang  Lee 

1.  Introduction 

Lightning  can  generate  whistler  waves  with  a  broad  spectrum  of  frequencies  but  with 
dominant  energy  in  the  range  of  1.6  -  10  KHz  with  a  peak  at  5  KHz  (HeDiwell,  1965). 
The  lightning-induced  whistler  waves,  having  intensities  of  a  few  tens  of  mV/m,  have  been 
measured  in  the  ionosphere  by  rocket-borne  instruments  (Kelley  et  al.,  1985;  M.C.  Kelley, 
personal  communication,  1985).  It  is  shown  in  Lee  and  Kuo  (1984)  that  whistler  waves  with 
intensities  of  tens  of  mV/m  can  excite  lower  hybrid  waves  and  field-aligned  zero-frequency 
(the  so-called  purely  growing)  mode  waves  simultaneously  via  a  parametric  instability.  This 
involves  a  four-wave  interaction  process  which  requires  a  threshold  intensity  of  whistler 
waves.  Only  the  VLF  (very-low-frequency)  band  (3-30  KHz)  of  the  lightning-induced 
whistler  waves  may  be  intense  enough  to  excite  lower  hybrid  waves  directly. 

However,  weak  whistler  waves  can  produce  weak  lower  hybrid  waves  in  the  presence 
of  short-scale  (i.e.,  a  few  meters  to  tens  of  meters)  field-aligned  ionospheric  density 
irregularities  (Groves  et  al.,  1988).  This  process,  by  contrast,  does  not  involve  a  plasma 
instability  but  a  nonlinear  scattering  process.  The  “pre-existing”  ionospheric  density 
striations  can  effectively  scatter  whistler  waves  into  lower  hybrid  waves  via  a  nonlinear 
mode  conversion  process.  The  short-scale  ionospheric  density  striations  can  appear  in  the 
natural  spread  F  process  (e.g.,  Woodman  and  LaHoz,  1976)  or  in  the  ionospheric  heating 
by  HF  waves  (e.g.,  see  the  1974  November  issue  of  Radio  Science).  Because  the  nonlinear 
scattering  process  does  not  require  a  threshold  field  of  whistler  waves,  lower  hybrid  waves 
with  a  broad  spectrum  of  frequencies  can  be  produced  during  lightning  in  the  ionosphere 
structured  with  short-scale  density  striations. 

The  appearance  of  lower  hybrid  waves  in  lightning  can  yield  prominent  ionospheric 
plasma  effects,  such  as  the  acceleration  of  electrons  (ions)  along  (across)  the  geomagnetic 
field  and  the  frequency  broadening  of  plasma  waves  (Kuo  and  Lee,  1992;  Lee  et  al.. 
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1997).  The  energization  of  electrons  and  ions,  caused  by  lower  hybrid  waves,  can  alter  the 
plasma  distribution  functions.  The  density  fluctuations,  associated  with  lower  hybrid  waves 
can  nonlinearly  scatter  high-frequency  plasma  waves,  e.g.,  Langmuir  waves  (produced  by 
photo-electrons  or  HF  waves)  into  frequency-upshifted  (so-called  anti-Stokes)  Langmuir 
waves  and  frequency-downshifted  (so-called  Stokes)  Langmuir  waves  (Kuo  and  Lee,  1992; 
Lee  et  al.,  1997). 

Presented  in  this  paper  are  our  laboratory  studies  of  afore-mentioned  ionospheric 
plasma  effects  produced  by  lightning-induced  whistler  waves.  The  reported  work  in  Section 
2  includes  the  generation  of  lower  hybrid  waves  by  whistler  waves  and  the  frequency 
broadening  of  Langmuir  waves  caused  by  the  lower  hybrid  waves.  Discussions  and 
conclusions  are  finally  given  in  Section  3. 

2.  Laboratory  Experiments 

(A)  The  Versatile  Toroidal  Facility  (VTF) 

Our  laboratory  experiments  were  carried  out  at  MIT’s  Plasma  Science  and  Fusion 
Center,  using  the  student-built  Versatile  Toroidal  Facility  (VTF)  which  has  a  major  radius 
of  about  1  meter,  a  minor  radius  of  0.4  meter,  and  a  height  of  1.1  meters  (R.F.  Duraski, 
M.S.  Thesis,  MIT,  1991).  Presented  in  Figures  1  and  2  are  the  characteristic  features  of 
the  VTF. 

As  illustrated  in  Figure  1,  the  VTF  has  a  helical  magnetic  field,  typically  0.07  tesla  (C. 
Yoo,  M.S.  Thesis,  MIT,  1991),  that  guides  electrons  emitted  from  heated  LaBg  filaments 
to  flow  from  the  bottom  side  to  the  topside  of  the  plasma  chamber.  These  upward  flowing 
electrons  serve  two  purposes:  (1)  creating  a  background  plasma  through  electron  rr»lligr»ng 
with  neutral  gases  (e.g.,  H2,  Ar,  O2),  and  (2)  forming  a  field-aligned  electric  current. 
Displayed  in  Figure  2  is  the  measured  background  plasma  density  and  the  toroidal  magnetic 
field  (approximately  equal  to  the  helical  magnetic  field)  as  a  function  of  the  major  radius  of 
the  VTF  plasma  chamber.  The  peak  density  can  reach  6  •  1017  m-^  in  a  hydrogen  plasma 
and  a  field-aligned  current  of  100  amperes  can  be  consistently  achieved. 

The  VTF  plasma  machine  provides  a  plasma  environment  that  ran  reasonably 
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Figure  1.  Schematic  illustration  of  the  Versatile  Toroidal  Facility  (VTF).  The  electron 
beam  system  is  a  heated  filament  coated  with  LaB6  to  emit  electrons. 


Figure  2.  The  toroidal  magnetic  field  and  the  density  profile  of  electron  beam- 
produced  plasmas  versus  the  radius  of  the  VTF. 
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simulate  the  ionospheric  F  region:  u>pe/uce  >  3,  and  Te  ~  Tt-  ~  5  eV  where  u>pc,  Uce,  Tc, 
and  T,  represent  the  electron  plasma  frequency,  the  electron  gyrofrequency,  the  electron 
temperature,  and  the  ion  temperature  in  the  VTF,  respectively.  As  shown  in  Figure  1 ,  a 
microwave  at  2.45  GHz  with  either  ordinary  or  extraordinary  polarization  is  injected  into 
the  plasma  chamber  from  a  magnetron  to  study  radio  wave  interactions  with  ionospheric 
plasmas. 

(B)  Generation  of  Whistler  Waves  and  Lower  Hybrid  Waves 

Whistler  waves  are  produced  in  the  VTF  by  the  injected  electron  beams  moving 
along  the  helical  magnetic  field.  As  displayed  in  Figure  3,  much  more  intense  whistler 
waves  are  generated  by  weak  electric  current  (1  ampere).  It  is  seen  that  low-frequency  (j 
10  MHz)  mode  waves  are  excited  under  stronger  electric  current  (120  ampere)  together 
with  the  generation  of  less  intense  whistler  waves.  By  contrast,  low-frequency  mode  waves 
are  absent  under  weak  electric  current,  when  intense  whistler  waves  are  generated.  The 
whole  spectrum  of  whistler  waves  in  the  VTF,  partially  shown  in  Figure  3,  is  quite  flat 
with  a  sharp  cut-off  slightly  exceeding  half  of  the  electron  gyrofrequency,  in  consistence 
with  the  spectrum  for  bremsstrahlung  [see,  e.g.,  Panofsky  and  Phillips,  1962]. 

During  the  operation  of  the  electron  beam  system  for  the  period  of  3  seconds,  the 
electric  current  reaches  the  steady-state  condition  in  the  VTF.  A  stronger  electric  current 
indicates  that  a  higher  plasma  density  is  produced  in  the  VTF  by  the  energetic  electron 
beam.  As  shown  in  Figure  2,  the  electron-beam-produced  plasma  has  a  sharp  density 
gradient  in  the  radial  direction  of  the  VTF  plasma  machine.  An  intense  electric  current, 
flowing  along  the  helical  magnetic  field,  together  with  a  sharp  radial  density  gradient 
forms  a  source  of  free  energy,  driving  the  so-called  current  convective  instability  (see, 
Ossakow  and  Chaturvedi,  1979  and  references  therein).  This  instability  is  responsible  for 
the  excitation  of  low-frequency  mode  waves,  as  seen  in  Figure  3,  with  a  growth  time  less 
than  the  confinement  time  («  a  fraction  of  a  milli-second)  of  the  VTF  by  one  order  of 
magnitude. 

What  cannot  be  seen  in  Figure  3  is  the  excitation  of  lower  hybrid  waves.  The  lower 
hybrid  resonance  frequency,  flhr  =  fa/(l  +/£//£) 1/2  calculated  to  be  30  MHz,  is  marked 
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Figure  3.  The  low-frequency  spectrum  of  whistler  waves  generated  by  the  injected 
electron  beams  under  the  conditions  of  a  high  electric  current  (120  ampere)  and  a  low 
electric  current  (1  ampere),  respectively. 
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on  the  horizontal  axis  in  Figure  3,  where  fpe,  fpi  ,  and  fee  are  the  electron  plasma  frequency, 
the  ion  plasma  frequency,  and  the  electron  gyrofrequency,  respectively.  The  two  mchanisms 
whereby  whistler  waves  generate  lower  hybrid  waves,  as  discussed  in  Lee  and  Kuo  (1984) 
and  Groves  et  al.  (1988),  respectively,  are  examined  in  the  VTF  experiments.  The  lower 
hybrid  waves  and  whistler  waves  are  measured  by  a  Langmuir  probe  and  a  loop  probe, 
respectively.  The  electric  current-excited  low-frequency  mode  waves,  measured  by  a  dipole 
probe,  are  found  to  be  highly  field-aligned.  If  Eq,  fa,  and  fa  denote  the  wave  field 
intensity  of  whistler  waves,  the  electric  potential  of  low-frequency  mode  waves,  and  the 
electric  potential  of  lower  hybrid  waves,  respectively,  the  results  of  the  VTF  experiments 
are  given  in  Figures  4(a),  4(b),  and  4(c).  Presented  for  discussions  in  Figure  4(a)  is  fa 
versus  the  product  of  EIq  and  fa  for  lower  hybrid  waves  with  frequencies  near  the  lower 
hybrid  resonance  frequency.  This  process  involves  whistler  waves  with  frequencies  higher 
than  those  of  lower  hybrid  waves  by  a  few  KHz,  which  are  approximately  equal  to  the 
frequencies  of  low-frequencies  mode  waves. 

Under  the  condition  of  high  electric  currents,  low-frequency  mode  waves  (tf> j)  ran  be 
favorably  excited  by  the  current-convective  instability  (Ossakow  and  Chaturvedi,  1979). 
Meanwhile,  weak  whistler  waves  (Eq)  and  lower  hybrid  waves  (fa)  appear  simultanously. 
It  is  found  that  fa  increases  with  the  product  (Efafa)  of  Eq  and  fa,  and  nearly  fa  oc  Eofa 
(see  Regime  B  of  Figure  4(a))  in  good  agreement  with  Equation  (9)  of  Groves  et  al.  (1988). 
The  VTF  results  support  Groves  et  al.’s  mechanism,  which  generates  weak  lower  hybrid 
waves  (fa)  via  the  nonlinear  scattering  of  whistler  waves  (E)q)  off  the  pre-existing  low- 
frequency  field-aligned  mode  waves  (<£i).  Two  things  should  be  noted.  One  is  that,  while 
the  nonlinear  scattering  process  discussed  in  Groves  et  al.  (1988)  is  based  on  waves  with 
single  wave  numbers  (Icq,  k\ ,  fa),  the  measured  waves  in  the  VTF  experiments  have  a  finite 
-bandwidth  of  wave  numbers.  The  other  is  that  the  measured  low-frequency  mode  waves 
are  not  zero-frequency  mode  waves.  Rather,  they  have  small  Doppler  frequencies  due  to 
moving  electrons  in  the  electric  current. 

Under  low  electric  currents  (typically,  a  few  ampere),  the  current  convective  insta¬ 
bility  cannot  be  excited  and  thus,  low-frequency  mode  waves  as  intense  as  those  shown 
in  Figure  4(a)  are  absent.  However,  intense  whistler  waves  (Eq)  and  lower  hybrid  waves 
(^2)  as  well  as  relatively  weak  low-frequency  mode  waves  (fa)  appear  concurrently.  The 
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<h  (mV) 


Figure  4(a).  The  electric  potential  {<h)  of  lower  hybrid  waves  as  a  function  of  the 
product  (Eq4>i)  of  whistler  wave  field  intensity  (Eq)  and  the  electric  potential  (<f>\)  of  low- 
frequency  mode  waves.  Regimes  (A)  and  (B)  correspond  to  the  conditions  of  high  electric 
currents  and  low  electric  currents,  respectively. 
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<h  (mV) 


Eq  (mV/m) 


Figure  4(b).  The  electric  potential  (<fo)  of  lower  hybrid  waves  as  a  function  of  the 
.  whistler  wave  field  intensity  (£©). 
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measured  wave  field  intensity  (£o)  of  whistler  waves  versus  the  electrical  potential  (^2)  of 
lower  hybrid  waves,  and  the  wave  field  intensity  (£0)  of  whistler  waves  versus  the  electri¬ 
cal  potential  (^j)  of  low-frequency  mode  waves  are  plotted  in  Figures  4(b)  and  Figre  4(c), 
respectively.  It  is  seen  that  the  intensities  of  both  the  lower  hybrid  waves  (<fc>)  and  low- 
frequency  mode  waves  increase  with  that  of  whistler  waves  (£0)  at  decreasing  rates. 
In  other  words,  when  the  intensity  of  whistler  waves  is  high,  those  of  low-frequency  mode 
waves  and  the  lower  hybrid  waves  become  saturated.  This  trend  can  also  be  seen  clearly 
in  Regime  A  of  Figure  4(a).  It  can  be  understood  in  terms  of  Lee  and  Kuo’s  mechanism 
(1984):  lower  hybrid  waves  and  low-frequency  mode  waves  are  excited  simultaneously  by 
whistler  waves  via  an  instability,  this  instability  is  seen  to  saturate  in  the  VTF  experi¬ 
ments  when  the  intensity  of  whistler  waves  is  high.  The  two  regimes  in  Figure  4(a)  show 
the  distinctive  difference  between  Lee  and  Kuo’s  mechanism  (1984)  and  Groves  et  al.’s 
mechanism  (1988)  in  generating  lower  hybrid  waves  by  whistler  waves.  The  transition  re¬ 
gion  seems  to  occur  at  the  intensity  of  whistler  waves  exceeding  1  V/m,  as  seen  in  Figures 
4(b)  and  4(c):  while  the  low-frequency  mode  waves  are  saturated,  the  lower  hybrid  waves 
are  not.  This  is  because  the  instability-excited  low-frequency  mode  waves  (Lee  and  Kuo, 
1984)  can  scatter  whistler  waves  into  lower  hybrid  waves  (Groves  et  al.,  1988)  superposed 
to  the  instability-excited  lower  waves. 

(C)  Spectral  Broadening  of  Plasma  Waves 

The  presence  of  lower  hybrid  waves  can  broaden  the  spectra  of  high-frequency  plasma 
waves,  for  instance,  the  photoelectron-induced  or  HFF  heater-induced  Langmuir  waves 
(see,  e.g.,  Kuo  and  Lee,  1992;  Lee  et  al.,  1997).  The  density  fluctuations  associated  with 
the  lower  hybrid  waves  can  nonlinearly  scatter  Langmuir  waves  to  produce  frequency- 
upshift  ed  (i.e.,  anti-Stokes)  and  frequency-downshifted  (i.e.,  Stokes)  langmuir  waves  as 
observed  in  the  ionospheric  heating  experiments  at  Aredbo,  Puerto  Rico  (Lee  et  al.,  1997) 
and  proved  theoretically  by  Kuo  and  Lee  (1992). 

It  should  be  pointed  out  that  frequency-upshifted  but  not  frequency-downshifted 
Langmuir  waves  were  detected  in  Aredbo  experiments  by  a  430  MHz  radar  transmitting 
diagnostic  waves  vertically.  Because  the  geomagnetic  dip  angle  at  Aredbo  is  about  40°, 
the  radar  only  picks  Langmuir  waves  upgoing  or  downgoing  at  an  angle  of  50°  with  respect 
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to  the  geomagnetic  field.  In  the  laboratory  (VTF)  experiments  reported  here,  both  the 
frequency-upshifted  and  frequency-downshifted  Langmuir  waves  were  detected  by  a  dipole 
probe  along  the  magnetic  field.  How  we  can  simulate  the  modifications  of  ionospheric 
Langmuir  waves  caused  by  the  lightning-induced  lower  hybrid  waves  is  briefly  described 
below. 

The  electron  beam-produced  plasmas  in  the  VTF  have  a  density  profile,  as  shown  in 
Figure  2,  similar  to  the  ionospheric  F  region  near  the  F  peak.  Except  near  the  edge  of  the 
density  profile,  the  ratio  of  electron  plasma  frequency  to  electron  gyrofrequenccy  can  exceed 
unity  under  the  condition  of  high  electric  currents,  simulating  the  ionospheric  plasma 
environment  reasonably  well.  As  afore-mentioned  in  Section  2(B),  under  high  electric 
currents,  low-frequency  mode  waves  and  weak  whistler  waves  are  generated  concurrently. 
The  subsequent  scattering  of  whistler  waves  off  the  low-frequency  mode  waves  leads  to 
the  nonlinear  mode  conversion  of  whistlerv  waves  into  lower  hybrid  waves  (see  Figure 
3).  However,  no  high-frequency  plasma  waves,  such  as  Langmuir  waves  and  upper  hybrid 
waves,  appear  inherently  in  the  VTF  plasmas  produced  by  intense  electron  beams. 

In  order  to  investigate  the  effect  of  lower  hybrid  waves  on  high-frequency  plasma 
waves,  an  intense  O-mode  microwave  is  injected  into  the  VTF  vacuum  chamber  from  a 
magnetron,  operated  at  the  frequency  of  2.45  GHz,  as  illustrated  in  Figure  5.  The  O-mode 
microwave  interacts  resonantly  with  the  VTF  plasmas  at  its  reflection  point  where  the 
wave  frequency  is  slightly  less  than  the  local  electron  plasma  frequency.  The  microwave- 
plasma  interaction  leads  to  the  decay  of  the  O-mode  microwave  into  a  Langmuir  wave  and 
an  ion  acoustic  wave  via  the  so-called  parametric  decay  instability.  The  excited  T^ngmnir 
wave  can  grow  at  the  expense  of  the  microwave  and,  then,  decay  into  a  duaghter  Langmuir 
wave  and  an  ion  acoustic  wave.  The  successive  decay  of  Langmuir  waves  yields  a  cascading 
spectrum  of  Langmuir  waves  as  shown  in  Figures  6(a),  6(b),  and  6(c).  These  Langmuir 
waves  were  measured  by  a  dipole  probe  (see  Figure  5).  Shown  in  Figure  6(a)  is  the 
spectrum  of  Langmuir  waves  in  the  absence  of  lower  hybrid  waves  (D.T.  Moriarty,  Ph.D. 
Dissertation,  MIT,  1996),  which  is  distinctively  different  from  those  in  the  presence  of 
lower  hybrid  waves. 

Figure  6(b)  is  the  spectrum  of  Langmuir  waves  measured  by  the  dipole  probe  along 
the  magnetic  field,  while  Figure  6(c)  is  that  at  an  angle  of  20°  with  respect  to  the  magnetic 
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Figure  5.  Illustration  of  VTF  experiments  on  microwave-excited  Langmuir  waves  and 
their  spectral  modifications  by  lower  hybrid  waves. 
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field.  It  is  interesting  to  note  that  both  the  frequency-upshifted  (i.e.,  anti-Stokes)  and 
frequency-downshifted  (i.e.,  Stokes)  Langmuir  waves  were  measured  by  the  dipole  probe 
along  the  magnetic  field.  By  contrast,  only  the  frequency-upshifted  Langmuir  waves  were 
detected  by  the  dipole  probe  at  20°  across  the  magnetic  field.  As  mentioned  before,  the 
Aredbo  radar,  transmitting  signals  at  50°  with  respect  to  the  geomagnetic  field  during  the 
ionospheric  heating,  could  only  detect  the  cascading  and  frequency-upshifted  Langmuir 
waves,  whose  spectrum  is  very  analogous  to  that  shown  in  Figure  6(c)  (Lee  et  a.,  1997). 

When  the  radar  and  the  dipole  probe  are  tilted  at  an  angle  with  respect  to  the 
magnetic  field,  they  preferentially  pick  obliquely  propagating  Langmuir  waves.  Langmuir 
waves  propagating  along  the  magnetic  field  satisfy  the  dispersion  relation: 

3A:2u£,  where  u>pc,  k,  t >te,  are  the  angular  electron  plasma  frequency,  wave  number,  and 
electron  thermal  velocity,  respectively.  Whereas,  Langmuir  waves  propagating  obliquely 
across  the  magnetic  field  in  ionospheric  plasmas  or  VTF  plasmas  follow  the  dispersion 
relation:  u>2  =  +  3fc2v£  +  fl2sm20,  where  fle  and  9  are  the  angular  electron 

gyrofrequency  and  the  wave  propagating  angle  with  respect  to  the  magnetic  field.  Thus, 
at  the  same  location,  Langmuir  waves  propagating  obliquely  have  larger  frequencies  than 
those  propagating  parallel  to  the  magnetic  field. 

When  parallel-propagating  Langmuir  waves  encounter  lower  hybrid  waves,  they  beat 
together  to  produce  Langmuir  waves  with  sum  frequencies  (viz.,  frequency-upshifted  or 
anti-Stokes  mode  waves)  and  difference  frequencies  (viz.,  frequency-downshifted  or  Stokes 
mode  waves).  If  these  beat  waves  propagate  along  the  magnetic  field,  both  the  frequency- 
upshifted  and  frequency-downshifted  Langmuir  waves  are  “off-resonance”  modes.  They  are 
equally  likely  detected,  as  we  have  demonstrated  in  the  VTF  experiments  with  a  dipole 
probe  oriented  along  the  magnetic  field.  By  contrast,  if  the  beat  waves  propagate  obliquely 
across  the  magnetic  field,  the  frequency-upshifted  (viz.,  anti-Stokes)  Langmuir  waves  are 
“normal”  or  “eigen”  modes,  while  the  frequency-downshifted  (viz.,  Stokes)  Langmuir  waves 
are  “off-resonance”  modes.  Consequently,  the  intensities  of  anti-Stokes  Langmuir  waves 
are  greatyer  than  those  of  Stokes  Langmuir  waves  by  several  oders  of  magnitude  as  shown 
in  Kuo  ad  Lee  (1992).  This  explains  why  only  frequency-upshifted  Langmuir  waves  were 
detected  by  the  Aredbo  radar  and  by  the  dipole  probe  tilted  off  the  magnetic  field. 


19-16 


3.  Summary  and  Conclusions 


Laboratory  experiments  on  the  Versatile  Toroidal  Facility  (VTF)  have  been  con¬ 
ducted  to  investigate  the  ionospheric  plasma  effects  produced  by  lightning-induced  whistler 
waves.  Prominent  plasma  effects  result  from  the  generation  of  lower  hybrid  waves  by  the 
lightning-induced  whistler  waves.  Intense  lower  hybrid  waves  and  zero-frequency  mode 
waves  can  be  excited  by  whistler  waves  simultaneously  in  a  four- wave  interaction  process 
via  an  instability  (Lee  and  Kuo,  1984).  Relatively  weak  lower  hybrid  waves  can  be  gen¬ 
erated  by  whistler  waves  in  the  presence  of  short-scale  field-aligned  density  irregularities. 
The  scattering  of  whistler  waves  off  density  irregularities  renders  the  nonlinear  mode  con¬ 
version  of  whistler  waves  into  lower  hybrid  waves  (Groves  et  al.,  1988).  Lower  hybrid  waves 
may  subsequently  interact  with  the  background  plasmas  to  cause  acceleration  of  electrons 
and  ions  along  and  across  the  magnetic  field,  respectively,  and  broaden  the  spectra  of 
background  plasma  waves  (Kuo  and  Lee,  1992;  Lee  et  al.,  1997). 

The  ionospheric  plasma  environment  can  be  reasonably  simulated  in  the  Versatile 
Toroidal  Facility  (VTF)  by  electron-beam  produced  plasmas.  The  electron  beam  moving 
along  the  helical  magnetic  field  produces  field-aligned  electric  currents.  Under  the  condition 
of  high  electric  currents,  intense  low-frequency  mode  waves  are  excited  by  the  current 
convective  instability  (Ossakow  and  Chaturvedi,  1979)  in  the  VTF  plasmas  which  have 
a  sharp  density  gradient.  Concurrently  generated  are  weak  whistler  waves,  which  are 
scattered  nonlinearly  by  the  intense  low-frequency  mode  waves  into  weak  lower  hybrid 
waves,  in  agreement  with  Groves  et  al.’s  mechanism  (1988).  The  scattering  of  whistler 
waves  by  ionospheric  density  irregularities,  rendering  the  nonlinear  mode  conversion  of 
whistler  waves  into  lower  hybrid  waves,  have  been  observed  in  recent  rocket  experiments 
on  lightning-induced  ionospheric  effects  (M.C.  Kelley,  personal  communication,  1997). 

Under  the  condition  of  low  electric  currents  in  the  VTF,  intense  whistler  waves  are 
generated,  but  intense  low-frequency  mode  waves  comparable  to  those  excited  by  the 
current  convective  instability  are  absent.  However,  intense  whistler  waves  can  excite  lower 
hybrid  waves  and  zero-frequency  mode  waves  simultaneously  as  discussed  in  Lee  and  Kuo 
(1984)  and  as  observed  in  the  ionosphere  (Berthelier  et  al.,  1982;  Liao  et  al.,  1989;  Sotnikov 
et  al.,  1991;  Dalkir  et  al.,  1992).  As  the  intensity  of  whistler  waves  increases,  the  excited 
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lower  hybrid  waves  and  zero-frequency  mode  waves  (having  a  Doppler  frequency  in  the 
VTF)  are  seen  in  Figures  4(a)  and  4(b)  to  saturate  as  expected.  After  the  low-frequency 
mode  waves  are  excited,  they  can  scatter  whistler  waves  and  render  the  nonlinear  mode 
conversion  of  whistler  waves  into  lower  hybrid  waves  (Groves  et  al.,  1988).  These  lower 
hybrid  waves,  originating  from  the  nonlinear  mode  conversion  process,  are  superposed 
to  those  excited  by  the  instability.  Consequently,  the  saturation  level  of  measured  lower 
hybrid  waves  (see  Figure  4(b))  is  not  as  prominent  as  that  of  the  low-frequency  mode 
waves  (see  Figure  4(c)  in  the  VTF. 

The  density  fluctuations  associated  with  the  lower  hybrid  waves  can  effectively  scatter 
high-frequency  ionospheric  plasma  waves,  e.g.,  the  photoelectron-induced  or  EM  wave- 
excited  Langmuir  waves,  and  thus,  modify  the  spectra  of  Langmuir  waves.  As  shown 
in  the  VTF  experiments  (see  Figure  6(a)),  Langmuir  waves  are  excited  by  an  O-mode 
microwave  via  the  parametric  decay  instability  (PDI).  The  PDI-excited  Langmuir  waves, 
propagating  primarily  parallelly  to  the  magnetic  field,  can  beat  with  lower  hybrid  waves  to 
produce  frequency-upshifted  (anti-Stokes)  suid  frequency-downshifted  (Stokes)  Langmuir 
waves  along  the  magnetic  field  (see  Figure  6(b)),  but  only  frequency-upshifted  Langmuir 
waves  across  the  magnetic  field  (see  Figure  6(c)).  For  parallel-propagating  Langmuir  waves, 
both  the  frequency-upshifted  (anti-Stokes)  and  frequency-downshifted  (Stokes)  Langmuir 
waves  are  off-resonance  modes,  which  have  equal  intensities.  By  contrast,  for  obliquely 
propagating  Langmuir  waves,  the  frequency-upshifted  Langmuir  waves  are  eigen  modes, 
while  the  frequency-downshifted  are  off-resonance  modes.  The  intensity  of  frequency- 
upshifted  Langmuir  waves  exceeds  that  of  frequency-downshifted  Langmuir  waves  by 
several  orders  of  magnitude  (Kuo  and  Lee,  1992).  This  explains  why  only  frequency- 
upshifted,  obliquely  propagating  Langmuir  waves  are  observed  in  the  VTF  experiments 
and  in  the  ionosphere  (Lee  et  al.,  1997). 
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ABSTRACT 

Microcontroller-based  feedback  algorithms  were  examined  for  applications  in  adaptive  structural  control.  The 
feedback  control  algorithms  consisted  of  tunable  second-order  filters  that  adaptively  suppressed  vibration  by 
tracking  the  frequency  of  resonant  modes.  Two  circuit  implementations  were  considered:  a  purely  digital 
design  implemented  on  a  microcontroller,  and  a  digitally-programmable  analog  design  implemented  using  a 
microcontroller  and  an  analog  active  filter.  The  purely  digital  design,  although  requiring  less  components, 
was  limited  by  the  speed  of  the  microcontroller  and  the  lack  of  floating  point  arithmetic.  An  initial  design 
consisting  of  one  second-order  filter  was  implemented  at  a  sampling  rate  of  2  kHz,  but  the  use  of  fixed- 
point  processing  limited  the  filter  frequency  to  above  40  Hz.  The  limitations  of  the  purely  digital  approach 
motivated  the  use  of  a  digitally-programmable  analog  filter  as  the  feedback  compensator.  Using  a  digitally- 
programmable  analog  filter  as  the  feedback  compensator  eliminated  problems  due  to  processor  speed  and 
fixed-point  arithmetic  while  maintaining  the  reconfigurability  of  the  algorithm.  A  frequency-tracking  filter 
was  implemented  using  a  frequency  identification  algorithm  implemented  with  a  phase-locked  loop  and  a 
microcontroller.  A  circuit  was  built  and  several  tests  were  performed  on  a  beam  with  a  variable  tip  mass. 
The  control  experiments  demonstrated  that  the  active  damping  achieved  with  a  fixed  filter  would  be  reduced 
by  a  factor  of  three  as  the  natural  frequency  of  the  beam  varied  24  Hz  to  approximately  11  Hz,  whereas 
the  adaptive  filter  maintained  closed-loop  damping  ratios  between  14%  and  20%  over  the  same  frequency 
range.  Furthermore,  the  frequency-tracking  algorithm  was  able  to  adapt  to  vibration  in  approximately  500 
fi s,  making  it  a  practical  algorithm  for  transient  vibration  suppression. 
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INTRODUCTION 


Active  damping  is  a  proven  solution  for  the  suppression  of  resonant  structural  vibration.  In  contrast  to 
passive  damping  methods  which  utilize  energy-dissipating  materials  to  suppress  vibration,  active  damping 
methods  utilize  feedback  control  systems  to  dissipate  structural  energy.  The  feedback  control  systems  consist 
of  vibration  sensors,  control  actuators,  and  electronic  circuits  that  are  designed  to  couple  to  the  structural 
vibration  and  dissipate  energy  in  lightly- damped  resonant  modes. 

Active  damping  methods  are  practical  approaches  to  vibration  suppression  due  to  their  simplicity  and 
inherent  robustness  with  respect  to  uncertainty  in  the  structural  dynamics.  Early  work  in  the  control 
of  flexible  structures  proved  that,  in  theory,  collocated  velocity  feedback  is  unconditionally  stable  in  the 
presence  of  unmodelled  resonant  modes  [1].  Although  theoretically  valid,  velocity  feedback  suffers  from 
several  practical  problems  due  to  its  sensitivity  to  sensor-actuator  noncollocation  and  unmodelled  phase 
lags  [2]. 

More  practical  methods  of  active  damping  account  for  unmodelled  dynamics  at  the  expense  of  reduced 
performance.  Feedback  control  systems  consisting  of  tuned  resonant  filters  are  more  robust  with  respect  to 
unmodelled  dynamics,  but  typically  only  add  damping  to,  at  most,  a  few  structural  modes  [3].  Broadband 
active  damping  is  achieved  by  designing  multiple  control  loops  and  tuning  the  individual  filters  to  different 
frequency  ranges  [4]. 

To  date  most  of  the  basic  theory  for  active  damping  has  been  developed  and  more  attention  is  focused 
on  the  implementation  of  active  structural  control.  Representative  space  structure  experiments  have  demon¬ 
strated  that  low-order  control  systems  have  the  ability  to  control  structural  vibration,  and  hardware  has  been 
developed  to  implement  a  wide  variety  of  active  damping  control  laws.  Even  more  recently  an  embedded 
control  device  consisting  of  integrated  sensors,  actuators,  and  control  electronics  was  developed  for  active 
structural  control  applications  [5]. 

As  these  references  demonstrate,  much  of  the  recent  work  in  active  structural  control  has  focused  on 
implementing  microprocessor-based  structural  control  systems  that  are  easily  reconfigurable.  In  the  former 
work,  a  standalone  microprocessor-based  control  system  with  a  graphical  user  interface  was  developed  for 
implementing  active  control.  Control  designs  were  performed  on  commercially  available  software  and  down¬ 
loaded  directly  to  the  microprocessor  via  a  standard  PC  interface.  In  a  similar  fashion,  the  embedded  control 
device  developed  by  CSA  Engineering,  Inc.,  although  not  microprocessor  based,  could  be  reconfigured  easily 
by  downloading  feedback  control  parameters  directly  from  a  personal  computer.  In  both  systems  a  ma¬ 
jor  emphasis  was  placed  on  developing  hardware  that  was  easily  changed  through  standard  microprocessor 
interfaces. 

The  majority  of  work  in  active  structural  control  has  focused  on  the  use  of  digital  signal  processors  for 
implementing  control  algorithms.  Digital  signal  processors  (DSPs)  are  the  microprocessor  of  choice  due  to 
the  fact  they  are  optimized  for  real-time  computation.  Not  only  do  DSPs  have  additional  hardware  that 
increases  the  computational  speed  of  the  processor,  their  programming  architecture  is  optimized  to  minimize 
the  amount  of  time  required  to  perform  floating  point  arithmetic.  These  two  features  give  DSPs  the  ability 
to  perform  accurate  real-time  computation  at  high  sampling  rates,  making  them  an  excellent  choice  for 
applications  in  active  structural  control. 

In  recent  years  the  increasing  power  of  DSPs  has  increased  the  practicality  of  active  control  algorithms. 
Currently,  state  of  the  art  DSPs  have  instruction  times  on  the  order  of  40  ns  and  can  perform  up  to  120 
million  floating  point  operations  per  second.  With  the  use  of  fast  analog-to- digital  and  digit al-to- analog 
converters,  it  is  possible  to  implement  high-order,  multi-input-multi-output  control  laws  with  sampling  rates 
on  the  order  of  10  to  30  kHz. 

The  increase  in  power  of  DSPs  has  been  paralleled  by  an  increase  in  power  of  another  class  of  microproces¬ 
sors  known  as  microcontrollers.  In  contrast  to  DSPs,  which  are  optimized  for  real-time  computation,  micro- 
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controllers  are  low-end  microprocessors  that  typically  include  on-board  memory  and  a  variety  of  peripherals. 
Although  not  nearly  as  powerful  as  DSPs  -  instruction  times  are  on  the  order  of  100  to  200  ns  and  an  on-chip 
hardware  multiplier  is  typically  not  included  in  the  device  -  the  primary  advantages  of  microcontrollers  are 
their  cost  and  wide  availability.  Currently,  one-time  programmable  microcontrollers  are  approximately  an 
order  of  magnitude  less  expensive  than  low-end,  fixed-point  DSPs.  Furthermore,  microcontrollers  are  widely 
used  in  a  number  of  price  constrained  industries,  such  as  automotive  and  telecommunications,  making  them 
easily  available  and  very  cost  effective. 

The  increasing  power  of  cost  effective  microcontrollers  motivates  their  use  as  processors  for  active  damping 
applications.  Although  microcontrollers  are  not  suitable  for  high  performance  applications  that  require 
fast  sampling  rates,  it  is  possible  that  they  are  useful  for  implementing  active  damping  control  laws  for 
structural  vibration  suppression.  The  fact  that  active  damping  control  laws  are  typically  low  order  and  do 
not  require  excessively  high  sampling  rates  means  that  digital  signal  processors  might  not  be  required  for 
digital  implementation.  A  more  cost  effective  approach  might  be  the  use  of  microcontrollers  specifically 
designed  for  implementing  active  structural  damping. 

The  objective  of  this  work  was  to  investigate  the  use  of  a  certain  class  of  microcontrollers  for  implement¬ 
ing  active  structural  damping,  with  a  particular  emphasis  on  the  use  of  adaptive  structural  control.  The 
first  section  of  this  report  discusses  the  theory  of  active  damping  and  develops  an  analytical  basis  for  the 
adaptive  control  algorithm.  The  second  section  examines  a  purely  digital  implementation  of  the  feedback 
compensator  and  discusses  the  limitations  that  arise  due  to  the  use  of  a  microcontroller.  The  third  sec¬ 
tion  examines  a  hybrid  approach  to  the  problem  in  which  the  feedback  compensator  is  implemented  on  a 
digitally-programmable  analog  filter  and  the  frequency-tracking  algorithm  is  resident  on  a  microcontroller. 
The  final  section  compares  the  two  approaches  and  discusses  the  major  conclusions. 


Active  Damping  for  Structural  Vibration  Suppression 

An  examination  of  the  equations  of  motion  for  a  vibrating  structure  motivates  the  use  of  tuned  second-order 
filters  for  active  damping  applications.  Consider  a  structure  modelled  by  a  linear  set  of  matrix  equations, 

Mx  4  Dx  4  Kx  —  Bww  4  Buu ,  (1) 

where  M,  D,  and  K  are  the  mass,  damping,  and  stiffness  matrices  of  the  dynamic  system;  x  is  a  time- 
dependent  vector  of  physical  coordinates;  w  and  u  are  the  exogenous  and  control  forces,  respectively;  and 
Bw  and  Bu  are  the  influence  matrices  of  the  two  sets  of  forces. 

If  the  mass-normalized  mode  shapes  are  denoted  ^  and  we  assume  that  the  damping  matrix  is  uncoupled 
by  the  undamped  mode  shapes,  we  can  diagonalize  the  system  through  the  coordinate  transformation 


a:  =  7} ,  (2) 

where  tj  is  the  time-dependent  vector  of  modal  coordinates  [7].  Substituting  the  previous  expression  into 
equation  1  and  premultiplying  by  i\)T  yields 

Iij  4  A77  4-  At]  =  %I)TBww  4  i)TBuu.  (3) 

The  outputs  of  the  dynamic  system  are  separated  into  the  regulated  variables,  2,  and  feedback  sensors,  y. 
Assuming  that  the  outputs  are  a  linear  combination  of  the  physical  states  and  the  inputs,  we  have  the  set 
of  equations 

z  =  C zd'E  4  C zv^  d-  BZyjW  4  B  zuu  .  . 

y  —  CydX  4  CZyX  4  Dye'll)  4  Byu'U') 
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which  can  be  transformed  into  modal  coordinates  through  equation  2: 


z  —  Czd'ip'n  +  Czv%prj  +  Dzww  +  /gx 

y  -=  Cyd'^P'Q  T  (-'yV^P'n  DyWW  H“  Dye'll. 

The  open-loop  transfer  matrices  for  the  dynamic  system  can  be  obtained  by  transforming  equations  3 
and  5  into  the  Laplace  domain  and  solving  for  z  and  y  in  terms  of  the  inputs  w  and  u.  This  results  in  a  set 
of  four  equations  which  are  denoted  TW,(s),  PZu(s),  Pyw{s),  and  Pyu{s)' 

Pzw{s)  =  CzdA(s)Bw  +  sCzvA(s)Bw  +  Dzw 
PZu{s )  =  CZdA(s)Bu  +  sCzvA{s)Bu  4-  Dzu 
Pyw{s)  =  Cy  dA(s)BW  +  sCyy  +  Dyw 

Pyu{s)  —  CydA(s)Bu  4-  sCyVA(s)Bu  4-  Dyu 

The  matrix  A(s)  is  defined  as 

j4(s)  —  (s2  4-  As  +  A)  1  ,ipT. 

The  closed- loop  system  is  obtained  by  implementing  the  linear  feedback  control  law,  K(s): 

y  =  K(s)u ,  (8) 

which  results  in  the  following  set  of  closed-loop  equations 

H(s)  =  Pzw(s)  +  Pzu(s)K(s)  (I  -  Pyu(s)K (s))-1  Pyw(s).  (9) 

Specific  expressions  in  terms  of  the  modal  matrices  and  input-output  matrices  are  obtained  by  substituting 
the  expressions  in  equation  6  into  equation  9. 

The  use  of  tuned  second-order  filters  is  motivated  by  the  decoupled  equations  of  motion.  Equation  3 
states  that,  under  certain  assumptions  regarding  the  structural  damping,  the  equations  of  motion  can  be 
decoupled  into  a  set  of  second-order  oscillators.  This  is  a  well  known  result  in  vibration  theory,  and  has 
interesting  implications  for  the  active  structural  control.  It  implies  that  we  can  examine  the  structural  control 
problem  on  a  mode-by-mode  basis  and,  to  a  certain  degree,  gain  insight  into  the  problem  by  examining  the 
control  of  a  simple  second-order  oscillator.  This  is  discussed  in  the  following  section. 


(6) 

(7) 


Single-Mode  Control 

Consider  the  example  of  a  single  structural  mode  where  the  regulated  variables  and  feedback  sensors  are 
simply  the  physical  displacement  of  the  mode.  In  a  practical  application,  this  would  mostly  likely  be  a  simple 
model  of  a  complex  structure  in  which  the  structural  mode  of  interest  is  well  separated  from  the  remaining 
flexible  dynamics.  In  this  case,  the  matrices  Czv  and  Cyv  as  well  as  all  of  the  direct  transmission  matrices, 
are  identically  equal  to  zero.  Furthermore,  the  remaining  matrices  reduce  to 


4(s)  = 


S2  +2C*  wss+u>2 


(10) 


Assuming  that 


Bw  —  a  B<u  —  b  C zd  c  Cyd  d, 


where  a,  b)  c,  and  d  are  constants,  the  open-loop  matrix  partitions  can  be  written  as 


Pzw{s)  = 
Pzu(s)  = 
Pyw{s)  = 
Pyu(s)  ~ 


ac 

be 

A  B(s)’ 


(ii) 


(12) 
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where. 


(13) 


A  s(s) 


s2  4-  2(SUJSS  4-  uj\ 
uj2 


and  uj$  and  £5  are  the  damping  and  frequency,  respectively,  of  the  structural  mode.  If  the  control  law,  K(s), 
is  a  positive  position  feedback  filter  expressed  in  terms  of  the  feedback  gain,  #,  and  the  filter  damping  and 
frequency,  £/  and  loj: 


*M  = 


92w) 


s2  4-  2QujfS  +  uj2  A f(s)' 
then  the  expression  for  the  closed-loop  system  is  reduced  to 

Af(s) 


H(s)  =  ac 


As(s)A/(s)  -b2g2' 


(14) 


(15) 


Equation  15  illustrates  that  the  closed-loop  poles  of  the  system  are  a  function  of  the  structural  parameters, 
the  filter  parameters,  and  the  open-loop  dc  gain.  The  gain  ac  is  irrelevant  to  the  closed-loop  damping  since 
it  simply  multiplies  the  closed-loop  transfer  function.  The  previous  expression  can  be  nondimensionalized 
by  substituting  s  =  usa,  dividing  through  by  uj4,  replacing  b2g 2  by  the  nondimensional  variable  72,  and 
replacing  ujj/ujs  by  the  nondimensional  ratio  77.  This  results  in  the  following  nondimensional  expression 


H(a)  = 


a2  4*  2 (fTf<7  4-  r2 

(a2  4-  2C fTfO  +  rj'j  (a2  -f  2 ($a  4- 1)  -  7 2r2f 


(16) 


which  can  be  expanded  into 


where 


H(a) 


a2  4-  2 (fTfcr  +  rj 
a4  4-  a3<73  4*  a2 cr2  +  a^o  4-  a0  ’ 


ai 

ao 


(l-72) 


(17) 


(18) 


Pole  Allocation  Approach  to  PPF  Design 

The  objective  of  the  PPF  design  is  to  choose  the  feedback  filter  parameters  that  achieve  a  desired  level  of 
closed-loop  damping.  As  equation  17  demonstrates,  the  closed-loop  system  has  a  fourth-order  characteristic 
equation,  which  is  equivalent  to  two  oscillating  poles  when  the  roots  of  the  polynomial  are  complex.  One 
complex  pair  corresponds  to  the  poles  of  the  feedback  filter,  while  the  other  pair  of  complex  poles  corresponds 
to  the  structural  mode  of  vibration.  For  a  given  set  of  filter  parameters,  and  07,  and  structural  parameters, 
Cs  and  o;s,  the  closed-loop  poles  are  simply  a  function  of  the  nondimensional  gain  7. 

One  method  of  designing  the  feedback  filter  is  to  set  the  closed-loop  characteristic  equation  equal  to  a 
set  of  desired  closed-loop  poles: 

(<72  +  2< ciVcia  +  r2cl)  (a2  +  2(clarcla  +  a2r2t)  = 

(a2  +  2C frfo  +  rfj  (a2  +  2Cs<r  +  l)  -  y2rj, 

where  (cl  is  the  damping  ratio  of  the  closed-loop  poles, 

rcl  =  oJd/uJs, 

a  is  a  scalar  constant,  and  ojci  is  the  desired  closed-loop  natural  frequency. 


(19) 

(20) 
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Figure  1:  Open-loop  filter  ratio  and  closed-loop  pole  ratio  as  a  function  of  the  design  parameters  of  a  positive 
position  feedback  filter. 


If  equation  19  is  expanded,  we  can  write  a  set  of  four  equations  that  must  be  satisfied  for  the  closed-loop 
poles  to  be  at  the  desired  location: 

2( ciTci  (1  +  a)  =  2</?7  +  2(s 

a2  r2,  (a2  +  1)  +  =  r2  +  1  +  4C/Csr-/ 

a1  2C cia2rh  +  2Qarh  =  2(frf  +  2(srj 
cr°  a2r^  =  rj  —  7  2rj 


Assuming  that  the  structural  damping  is  negligible  (£5  ~  0),  we  can  solve  these  four  equations  and  write  a 
set  of  design  rules  for  the  PPF  filter: 


Equations  22  through  25  can  be  used  to  choose  the  filter  parameters  as  a  function  of  the  nondimensional 
gain,  7,  and  the  ratio  of  the  closed-loop  natural  frequencies,  a.  Equations  22  and  23  demonstrate  that  the 
natural  frequency  ratios  are  simply  a  function  of  either  a  or  7,  while  the  closed-loop  damping  ratios  are 
a  function  of  both  the  nondimensional  gain  and  the  ratio  of  closed-loop  natural  frequencies.  Referring  to 
Figure  1,  we  see  that  the  ratio  of  closed-loop  poles,  rc/,  is  simply  equal  to  1  if  a  =  1.  Also,  the  figure  indicates 
that  the  open-loop  filter  ratio,  ujf  /ljs,  that  achieves  the  desired  closed-loop  pole  locations  has  an  asymptote 
of  1  as  the  control  gain  decreases  towards  zero. 

The  achievable  closed-loop  damping  ratio  is  a  more  complicated  function  of  the  design  parameters  a  and 
7.  Examining  equation  24,  we  see  that  the  closed-loop  damping  ratio  is  a  function  of  both  the  nondimensional 
gain  and  the  ratio  of  closed-loop  natural  frequencies.  For  a  particular  value  of  a  we  can  plot  the  variation 
of  (ci  as  a  function  of  the  nondimensional  gain  and  determine  the  achievable  performance. 

Consider  the  case  plotted  in  Figure  2  for  a  =  1.  For  this  case,  nondimensional  gains  less  than  approx¬ 
imately  0.5  yield  a  linear  variation  in  the  closed-loop  damping  ratio.  As  the  gain  becomes  larger  than  0.5, 
the  closed-loop  damping  ratio  increases  more  rapidly  with  respect  to  7  than  in  the  region  less  than  0.5. 
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Figure  2:  Closed-loop  damping  ratio,  Cel )  and  required  filter  damping  ratio,  (/,  as  a  function  of  the  nondi- 
mensional  gain  for  a  =  1. 

Also  plotted  in  Figure  2  is  the  filter  damping  ratio  required  to  achieve  the  desired  closed-loop  damping.  As 
plotted  in  the  Figure,  and  highlighted  in  equation  25,  the  required  damping  ratio  is  approximately  equal  to 
twice  the  desired  closed-loop  damping  ratio  in  the  low-gain  region  (7  <  0.5). 

Combining  the  information  from  Figures  1  and  2  yields  a  set  of  practical  rules  for  the  design  of  PPF 
compensators: 

1.  For  a  gain  margin  of  greater  than  2.0  (6  dB),  the  desired  closed-loop  damping  ratio  is  achieved  by 
setting  the  natural  frequency  of  the  filter  approximately  equal  to  the  frequency  of  the  target  structural 
mode. 

2.  When  the  nondimensional  gain  is  less  than  0.5  and  a  =  1,  the  closed-loop  damping  in  the  structural 
mode  is  approximately  equal  to  O.67  and  the  required  damping  ratio  of  the  feedback  compensator  is 
approximately  equal  to  twice  the  desired  closed-loop  damping. 

These  design  rules  trade  off  performance  in  the  interest  of  increasing  robustness.  Figure  2  illustrates  that  a 
larger  amount  of  closed-loop  damping  is  achievable  if  the  gain  of  the  feedback  compensator  is  increased.  Of 
course,  this  increase  in  observable  performance  occurs  at  the  expense  of  gain  margin  in  the  control  system. 
Reducing  the  gain  margin  at  low  frequency  increases  the  likelihood  that  changes  in  the  DC  gain  of  the 
structural  response  will  lead  to  closed-loop  instability.  Keeping  the  gain  margin  greater  than  2.0  (or  6  dB) 
means  that  the  DC  gain  of  the  structural  response  can  change  by  up  to  a  factor  of  two  before  closed-loop 
instability  will  occur. 


Digital  Filter  Implementation 

The  first  half  of  the  work  concentrated  on  a  digital  implementation  of  the  positive  position  feedback  filter. 
The  primary  objective  of  the  work  was  to  determine  the  following  limitations  on  the  use  of  the  microcontroller: 

•  Maximum  sampling  rate  of  the  digital  filter, 

•  Maximum  number  of  second-order  filters  that  could  be  implemented  on  the  microcontroller, 

•  The  limits  on  filter  accuracy  using  fixed-point  arithmetic. 

The  digital  filter  implementation  of  the  vibration  suppression  algorithm  consisted  of  three  components:  an 
LTC  1298  12- bit  analog-to-digital  converter  (ADC),  a  PIC16C65A  microcontroller,  and  a  12-bit  LTC  1451 
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Table  1:  Measured  operation  times  for  implementing  a  second-order  HR  filter  on  the  PIC16C65A  with  a  20 
MHz  clock  speed 


Operation 

Number  of  Instructions 

Time  @  20  MHz  (/us) 

Read  ADC 

637 

127.4 

Compute  output 

1307 

261.3 

Output  to  DAC 

555 

111.0 

Total 

499.7 

digital- to- analog  converter  (DAC).  Code  for  interfacing  with  the  LTC  1298  was  already  available,  and  custom 
code  was  written  to  interface  with  the  LTC  1451.  Code  was  developed  on  the  PC  and  downloaded  to  the 
microcontroller  via  the  PicStart  Plus  hardware. 

Code  was  developed  to  implement  a  second-order  Infinite  Impulse  Response  (HR)  filter  using  the  finite 
difference  equations: 

d(k)  —  u(k)  -  aid(k  -  1)  -  a2d(k  -  2)  .  . 

y(k)  =  b0d(k)  -f  bid(k  -  1)  -f  b2d(k  —  2)  ^  ' 

where  u(k )  and  y(k)  are  the  filter  input  and  output,  respectively,  at  the  kth  time  step;  d(k)  is  an  intermediate 
variable;  and  ai,  a2)  6o,  &i,  and  b2  are  the  coefficients  that  define  the  frequency  response  of  the  filter.  Three 
microcontroller  operations  were  required  to  implement  the  HR  filter: 

1.  Read  a  value  from  the  ADC 

2.  Solve  finite  difference  equation  (equation  26)  to  compute  filter  output 

3.  Output  value  to  DAC 

The  speed  and  accuracy  at  which  the  microcontroller  performed  each  of  these  steps  determined  the  sampling 
rate  and  frequency  limitations  of  the  filter. 

Maximum  Sampling  Rate 

The  sampling  rate  of  the  digital  filter  was  set  by  the  speed  at  which  the  microcontroller  could  read  the  ADC, 
solve  the  finite  difference  equation,  and  output  a  value  to  the  DAC.  Table  1  lists  the  measured  values  at 
which  the  PIC  microcontroller  could  perform  each  of  these  operations.  As  Table  1  indicates,  the  total  time 
to  complete  all  of  the  operations  was  «  500  /is,  setting  the  filter  sampling  rate  at  2  kHz.  Approximately 
52%  of  the  computation  time  was  dedicated  to  computing  the  filter  output,  while  the  remaining  time  was 
spent  reading  and  writing  to  the  analog  I/O. 

There  are  two  ways  to  increase  the  sampling  rate  of  the  digital  filter:  replace  the  current  ADC  and  DAC 
hardware  and  optimize  the  code  that  performs  the  finite  difference  equation.  The  largest  gains  could  be 
obtained  by  changing  the  hardware  for  the  A/D  and  D/A  conversions.  Both  the  LTC  1298  and  LTC  1451 
have  12-bit  serial  interfaces,  which  reduces  the  number  of  I/O  lines  required  to  interface  with  them  -  3  in 
the  case  of  the  ADC  and  4  in  the  case  of  the  DAC  -  but  significantly  increases  the  time  required  to  transfer 
data  to  and  from  the  processor. 

A  much  faster  option  would  be  to  replace  the  serial  I/O  with  a  parallel  interface.  Parallel  communication 
requires  a  larger  number  of  I/O  ports,  but  requires  much  less  time  due  to  the  fact  the  data  transfer  between 
the  microprocessor  and  ADC/D  AC  takes  place  almost  simultaneously.  Although  no  timing  studies  were 
performed  to  directly  compare  a  serial  interface  with  a  parallel  interface,  it  is  probable  that  the  time  required 
for  reading  and  writing  to  the  analog  I/O  would  be  reduced  by  a  factor  of  5  to  10:  from  roughly  250  ys  to 
between  50  and  25  //s.  Thus,  simply  replacing  the  serial  interface  with  a  parallel  interface  could  increase 
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Table  2:  Estimated  computation  time  and  sampling  rate  for  an  optimized  filter  implementation. 


Operation 

Current  code 
(Number  of  cycles) 

Optimized  code 
(Number  of  cycles) 

Optimized 
Computation  Time 
at  20  MHz  (/zs) 

Optimized 
Computation  Time 
at  33  MHz  (/zs) 

Read  ADC 

637 

60 

12 

7.3 

Compute  output 

1307 

800 

160 

97 

Write  to  DAC 

555 

60 

12 

7.3 

Total  time  (/zs): 

184 

111 

Sampling  rate  (kHz): 

5.44 

9.01 

the  sampling  rate  from  2  kHz  to  between  3.33  and  3.63  kHz,  although  it  would  also  increase  the  number  of 
required  digital  I/O  lines  from  7  to  between  25  and  30. 

Another  way  of  increasing  the  sampling  rate  would  be  to  optimize  the  code  that  solves  the  finite  difference 
equation  of  the  filter  and  computes  the  output  signal.  The  current  generation  of  code  was  compiled  from  a 
C  program  and  downloaded  into  the  microcontroller.  It  is  probable  that  the  program  could  be  optimized  by 
working  directly  with  the  assembly  code;  possibly  by  handcoding  loops  and  reducing  the  amount  of  overhead 
associated  with  reading  and  writing  intermediate  variables. 

The  gains  associated  with  code  optimization  are  estimated  to  be  between  25%  and  50%,  much  less 
than  those  obtained  by  replacing  the  current  suite  of  digital-to-analog  I/O.  That  estimate  is  based  on 
results  published  in  the  Embedded  Control  Handbook ,  in  which  a  second-order  HR  filter  was  implemented  in 
approximately  800  instruction  cycles  by  thoroughly  optimizing  the  assembly  code  [8].  Assuming  a  20  MHz 
clock  frequency  and  a  200  ns  instruction  time,  the  time  required  to  solve  the  finite  difference  equation  would 
be  160  /is  and  the  sampling  rate  of  the  filter  would  be  set  at  6.25  kHz.  These  estimates  are  summarized  in 
Table  2. 

The  final  method  of  increasing  the  sampling  is,  of  course,  to  increase  the  clock  speed  of  the  processor. 
Although  the  fastest  processors  presently  available  from  Microchip  were  used  for  these  studies  (20  MHz), 
Microchip  is  currently  developing  a  33  MHz  version  of  their  PIC16CXX  line  of  microcontrollers.  Upgrading 
to  a  33  MHz  clock  speed  would  result  in  a  65%  increase  in  speed  due  to  the  fact  that  the  instruction  cycle 
time  would  be  reduced  from  200  ns  to  121  ns.  An  estimate  of  the  computation  time  and  sampling  rate  of  a 
second-order  filter  running  on  a  33  MHz  processor  is  also  included  in  Table  2. 

Maximum  Filter  Order 

The  maximum  number  of  filter  sections  that  could  be  implemented  on  the  PIC  microcontroller  was  interre¬ 
lated  with  the  maximum  sampling  rate  of  the  digital  filter.  If  we  assume  that  the  digital  filter  is  simply  a 
parallel  implementation  of  N  second-order  sections  and  the  fixed  overhead  required  for  the  multiple  sections 
is  identical  to  that  required  for  one  section,  then  it  is  relatively  straightforward  to  determine  an  expression 
for  the  time  required  to  solve  N  finite  difference  equations: 

Current  code:  238.4  4-  261*(Number  of  filter  sections)  /zs 
Optimized  using  a  20  MHz  clock  (estimate):  24  4-  160*(Number  of  filter  sections)  /zs 

Optimized  using  a  33  MHz  clock  (estimate):  14.6  +  97* (Number  of  filter  sections)  /zs 

The  sampling  rate  of  the  digital  filter  can  be  plotted  as  a  function  of  the  section  order  to  compare  the 
performance  of  different  implementations.  Figure  3  demonstrates  that,  as  expected,  increasing  the  filter 
order  decreases  the  achievable  sampling  rate.  A  rule  of  thumb  for  digital  filter  design  is  that  the  filter 

dynamics  should  be  no  higher  than  roughly  1/10  of  the  filter  sampling  rate.  With  this  rule  in  mind, 
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Figure  3:  Achievable  sampling  rate  of  the  digital  filter  as  a  function  of  section  order  (left)  and  required 
dynamic  range  of  the  filter  coefficients  (right). 


Figure  3  can  be  used  to  approximate  the  highest  filter  frequency  that  can  be  achieved  with  the  different 
implementations:  simply  divide  the  sampling  rate  by  10  and  that  is  a  rough  estimate  of  the  maximum 
achievable  filter  frequency. 


Limitations  due  to  Fixed-Point  Arithmetic 


The  previous  two  sections  discussed  the  limitations  on  the  filter  sampling  rate  as  a  function  of  the  code 
implementation  and  the  section  order.  Another  limitation  was  imposed  on  the  digital  filter  implementation 
due  to  the  fact  that  the  PIC  microcontroller  is  not  optimized  to  perform  floating  point  arithmetic  operations. 
The  lack  of  a  fast  floating  point  processor  -  like  those  found  in  DSPs  -  requires  the  use  of  fixed-point 
mathematics  in  the  real-time  solution  of  the  finite  difference  equation.  The  interesting  problem  with  fixed- 
point  arithmetic  is  that,  unlike  the  limitations  due  to  code  optimization  and  section  order,  the  constraints 
limit  the  lowest  frequency  achievable  with  the  digital  filter. 

The  performance  limitations  due  to  fixed-point  processing  were  studied  by  discretizing  the  expression  for 
the  continuous  time  filter  (equation  14).  Using  the  Tustin  transformation  [6], 


2^-1 
”  Tz  +  V 


(27) 


on  where  T  is  the  sample  time  of  the  filter,  the  continuous-time  representation  of  the  compensator  yields 
the  2-domain  polynomial  of  the  discretized  filter: 


K(z)  = 


bo  b\z  *  +  622  ^ 


1  +  aiz~l  +  a<2Z~2 
where  the  coefficients  are  expressed  in  terms  of  Q  and  <f>  =  2i rf/T: 


n  —  2<j>2-8 

0,1  “  4+4C /*+*a 

_  4—4< f<t>+<P2 

“  4+4C f<t>+4>2 

h  —  4>2 

00  “  4+4C 7^+5* 

=  2  b0  =  24+4^+<£-i 

b2  =  bo  = 


(28) 


(29) 


For  a  filter  with  a  specified  damping  ratio,  the  only  parameter  that  affects  the  value  of  the  coefficients  is  the 
ratio  of  the  filter  frequency  to  the  sampling  frequency.  Figure  3  is  a  plot  of  the  ratio  of  the  maximum  value 
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Figure  4:  Digital  filter  implementation  (solid)  and  the  frequency  response  of  the  continuous-time  represen¬ 
tation  (dashed). 


of  the  coefficients  to  the  minimum  value  over  a  range  of  ffT  for  (/  =  1.  The  spike  in  the  dynamic  range  is 
caused  by  the  fact  that  one  of  the  coefficients  passes  through  zero  and  changes  sign,  thus  causing  a  division 
by  zero  error.  The  spike  is  irrelevant,  though,  due  to  the  fact  it  occurs  at  a  ratio  greater  than  1/10. 

The  important  part  of  the  plot  is  in  the  region  below  a  filter  frequency  to  sampling  frequency  less  than 
1/10.  As  Figure  3  indicates,  the  required  dynamic  range  in  the  filter  coefficients  increases  with  a  slope  of  40 
dB/decade  on  a  log-log  plot.  Thus,  reducing  ffT  by  a  factor  of  10  increases  the  required  dynamic  range  by 
a  factor  of  100. 

Also  plotted  in  Figure  3  is  the  maximum  dynamic  range  achievable  with  8-bit  and  16-bit  arithmetic.  Due 
to  the  fact  that  the  dynamic  range  requirements  increase  as  ffT  decreases,  a  hard  constraint  on  dynamic 
range  limits  the  lowest  frequency  that  is  achievable  with  the  discretized  filter.  In  the  case  of  8-bit  arithmetic, 
which  has  a  dynamic  range  of  256:1,  the  limit  is  ~  0.03  and  in  the  case  of  16-bit  arithmetic,  which  has  a 
dynamic  range  of  65536:1,  the  limit  is  «  0.002.  Combining  this  lower  limit  with  the  higher  limit  imposed  by 
sampling  frequency  yields  bounds  on  the  achievable  filter  frequency  for  8-bit  and  16-bit  fixed-point  arithmetic. 


Digital  Filter  Implementation 

Several  digital  filters  were  implemented  on  the  PIC16C65A  to  determine  the  performance  of  the  microcon¬ 
troller.  Code  was  downloaded  onto  the  microcontroller  and  the  input-output  frequency  response  was  obtained 
with  a  spectrum  analyzer.  Figure  4  is  a  magnitude  and  phase  plot  of  a  digital  filter  implemented  with  a  DC 
gain  of  1,  a  100  Hz  filter  frequency,  and  a  damping  ratio  of  0.10;  also  overlayed  on  the  plot  is  the  frequency 
response  of  the  continuous-time  filter  with  the  same  parameters.  The  digital  filter  was  implemented  with  a 
sampling  rate  of  2  kHz. 

The  measurements  illustrate  that  at  ffT  =  0.05,  the  digital  implementation  is  able  to  accurately  match 
the  magnitude  of  the  continuous  time  filter  but  significant  lags  exist  in  the  phase  response.  The  magnitude 
is  matched  well  across  the  whole  frequency  range;  the  only  significant  differences  occur  at  approximately  1 
kHz  -  or  half  the  sampling  rate.  Significant  differences  occur  in  the  phase  response  due  to  delays  in  the 
zero-order  hold  of  the  DACs  and  processing  delays  in  the  microcontroller. 

The  errors  that  occurred  due  to  fixed-point  processing  were  studied  by  implementing  several  filters  with 
varying  center  frequencies.  Figure  5  is  a  magnitude  plot  of  several  different  digital  filter  implementations, 
each  with  different  center  frequency.  Although  all  of  the  DC  gains  were  set  equal  to  1  in  the  discretized 
transfer  function,  the  inability  to  accurately  represent  the  continuous-time  filter  caused  magnitude  errors  at 
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Figure  5:  Several  digital  filter  implementations  demonstrating  the  errors  that  occur  at  low  frequency  due  to 
fixed-point  arithmetic. 

low  frequencies.  The  problem  became  more  pronounced  as  f fT  was  decreased  to  approximately  0.02  -  or  a 
filter  frequency  of  40  Hz.  For  this  filter,  the  DC  gain  error  was  approximately  20%  due  to  the  fact  that  the 
fixed-point  arithmetic  did  not  have  the  dynamic  range  to  accurately  represent  the  discrete  filter  coefficients. 


Digitally-Programmable  Analog  Implementation 
of  the  Adaptive  Filter 

The  limitations  imposed  by  digital  sampling  rate  and  fixed-point  processing  motivated  the  use  of  a  ‘hybrid’ 
approach  to  the  adaptive  filter  implementation.  In  the  hybrid  approach,  the  PPF  compensator  was  imple¬ 
mented  on  a  digitally-programmable  analog  filter  and  the  microcontroller  acted  as  a  ‘supervisor’  circuit  that 
determined  the  filter  frequency  and  set  the  filter  parameters.  Unlike  the  previous  implementation,  which  had 
no  method  of  determining  the  settings  for  the  feedback  filter,  the  hybrid  implementation  also  incorporated 
a  phase-locked  loop  to  measure  the  natural  frequency  of  the  vibrating  structure. 

A  block  diagram  of  the  digitally-programmable  analog  implementation  is  shown  in  Figure  6.  The  circuit 
consists  of  four  main  elements: 

1.  Phase- Locked  Loop  The  vibration  of  the  beam  was  measured  using  a  structural  sensor  and  input  to 
the  phase-locked  loop  to  produce  a  square  wave  that  had  the  same  frequency  as  the  beam  vibration. 

2.  Microcontroller  with  a  Mechanical  Trigger  The  microcontroller  measured  the  output  of  the 
phase-locked  loop  and  determined  the  correct  clock  signal  frequency  for  the  digitally-programmable 
analog  filter.  A  mechanical  trigger  was  implemented  so  that  the  microcontroller  could  be  reset  by 
simply  holding  the  beam  back  for  approximately  two  seconds  and  releasing  it. 

3.  Digitally-Programmable  Analog  Filter  The  positive  position  feedback  compensator  was  imple¬ 
mented  on  a  switched-capacitor  filter  that  could  be  reprogrammed  with  the  microcontroller.  An  analog 
smoothing  filter  was  placed  in  series  behind  the  digitally-programmable  filter  to  reduce  audible  noise 
due  to  discrete  steps  in  the  filter  output. 

4.  Bridge  Amplifier  The  actuators  were  driven  with  a  simple  bridge  amp  implemented  on  a  standard 
operational  amplifier. 

Each  of  these  components  will  be  discussed  in  more  detail  in  the  following  subsections. 
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Figure  6:  Block  diagram  of  the  digitally-programmable  analog  implementation  of  the  adaptive  filter. 

Phase-Locked  Loop 

The  main  function  of  the  phase-locked  loop  was  to  track  the  motion  of  the  beam  and  output  a  logic  signal 
of  the  same  frequency  as  the  natural  frequency  of  the  beam.  The  two  primary  reasons  to  use  a  phase-locked 
loop  to  perform  this  function  were: 

1.  The  phase-locked  loop  can  track  a  frequency  that  is  heavily  buried  in  noise,  even  when  the  signal  to 
noise  ratio  is  less  than  or  equal  to  one.  This  ‘noise’  can  be  actual  random  noise  in  the  circuit,  or  the 
‘noise’  could  actually  be  other  structural  modes  of  vibration. 

2.  The  phase-locked  loop  outputs  a  square  wave  signal  that  can  be  easily  measured  by  the  microcontroller 
for  the  purpose  of  setting  the  correct  parameters  of  the  feedback  filter. 

The  phase-locked  loop  is  itself  a  feedback  circuit  that  has  three  main  components:  a  signal  multiplier  that 
outputs  a  square  wave  that  has  a  duty  cycle  proportional  to  the  difference  in  the  frequency  of  the  two 
input  signals,  an  analog  filter,  and  a  voltage  controlled  oscillator  (VCO)  that  outputs  a  square  wave  whose 
frequency  is  proportional  to  the  dc  input.  Many  (many,  many)  types  of  phase-locked  loops  are  commercially 
available,  but  all  the  different  types  essentially  can  categorized  into  analog  or  digital  phase-locked  loops, 
where  type  of  phase-locked  loop  refers  to  the  type  of  signal  multiplier  that  is  used  in  the  feedback  circuit. 

Although  a  great  number  of  phase-locked  loops  exist,  the  principles  on  which  they  operate  are  essentially 
the  same.  The  reference  signal  is  compared  to  the  output  of  the  voltage-controlled  oscillator  through  what 
amounts  to  an  analog  multiplication.  The  output  of  the  signal  multiplier  has  two  components:  a  dc  offset 
that  is  proportional  to  the  phase  difference  between  the  two  signals,  and  an  oscillating  component  whose 
frequency  is  twice  the  frequency  of  the  input  signals.  The  multiplier  output  is  passed  through  a  lowpass 
analog  filter  that  reduces  the  oscillating  component  and  passes  the  dc  component  of  the  signal.  The  dc  signal 
then  sets  the  oscillator  frequency  of  the  VCO.  The  feedback  loop  minimizes  the  error  between  the  phase  of 
the  reference  signal  and  the  VCO  output. 

The  dynamics  of  phase-locked  loops  are  rather  complicated,  but  they  are  relatively  easy  to  implement 
in  practice.  For  small  perturbations  about  the  center  frequency  of  the  VCO,  the  control  system  is  ap¬ 
proximately  linear  and  can  be  analyzed  using  standard  frequency-domain  design  techniques  [10].  Nonlinear 
analysis  methods  are  required  to  analyze  large  deviations  from  the  center  frequency  of  the  VCO,  but,  in 
general,  complex  analyses  were  not  required  for  implementing  phase- locked  loops  [11].  In  practice,  all  of  the 
information  required  for  implementing  the  PLL  was  on  the  datasheet  that  came  with  the  device  [12]. 

A  digital  phase-locked  loop  was  used  in  the  adaptive  filter  circuit  to  simplify  the  microcontroller  interface. 
Although  the  operating  principles  were  essentially  the  same  as  describe  previously,  the  VCO  output  was  a 
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reference 


Figure  7:  Block  diagram  of  a  phase-locked  loop. 


TTL  logic  signal  that  could  be  easily  interpreted  with  the  microprocessor  for  adapting  the  filter  parameters. 
The  center  frequency  and  operating  range  of  the  phase-locked  loop  were  set  at  8.35  Hz  and  5  Hz  to  24  Hz, 
respectively,  for  all  of  the  control  experiments. 

Digitally-Programmable  Analog  Filter 

The  PPF  compensator  was  implemented  on  a  digitally-programmable  analog  (DPA)  filter  that  could  be 
reprogrammed  via  the  digital  I/O  ports  of  the  microcontroller.  The  DPA  was  a  switched-capacitor  filter 
whose  switching  frequency,  fdk ,  was  set  by  a  square  signal  applied  to  its  oscillator  input  [9].  The  ratio  of 
the  filter  frequency  to  the  oscillator  frequency  was  set  via  a  6-bit  digital  word  that  was  downloaded  serially 
from  the  microcontroller  to  the  DPA  filter.  For  all  of  the  present  analyses,  the  ratio  of  ff/ fdk  was  set  at  200 
to  minimize  the  phase  lags  that  occur  due  to  the  holds  at  the  filter  output.  The  damping  ratio  of  the  filter 
was  set  via  a  7-bit  digital  word  also  downloaded  serially  in  conjunction  with  the  frequency  ratio  settings. 

The  use  of  a  DPA  filter  to  implement  the  PPF  filter  eliminated  two  of  the  problems  that  occurred  when 
using  a  purely  digital  implementation  of  the  compensator: 

1.  Problems  due  to  fixed-point  processing  were  nonexistent  due  to  the  fact  that  the  filter,  although 
discretized  due  to  the  switched-capacitor  implementation,  was  an  analog  device  that  did  not  use  a 
finite  difference  equation  to  compute  the  filter  output. 

2.  Phase  lags  due  to  sample  and  hold  were  negligible  due  to  the  fact  that  the  switching  frequency  was 
200  times  greater  than  the  cutoff  frequency  of  the  filter. 

The  discrete  nature  of  the  device  (due  to  the  switched-capacitor  implementation)  necessitated  the  use  of 
a  smoothing  filter  in  series  with  the  filter  output.  Without  the  smoothing  filter,  the  actuators  would  emit 
audible  noise  in  the  2  kHz  to  6  kHz  frequency  range  when  the  beam  was  moved.  For  this  reason,  a  second- 
order  Chebyshev  filter  with  a  break  frequency  of  approximately  1  kHz  was  placed  in  series  with  the  filter 
output.  The  smoothing  filter  eliminated  a  significant  portion  of  the  audible  noise  without  adversely  affecting 
the  performance  of  the  feedback  compensator. 

Microcontroller  with  Mechanical  Trigger 

Implementing  the  PPF  compensator  on  a  digitally-programmable  analog  filter  offloaded  a  significant  portion 
of  the  real-time  computation  from  the  microcontroller.  In  this  implementation,  the  microcontroller  acted 
primarily  as  a  timer/counter  that  used  the  output  of  the  phase-locked  loop  to  determine  the  parameters 
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of  the  switched-capacitor  filter.  As  described  in  the  previous  subsection,  the  center  frequency  of  the  DPA 
filter,  //,  is  set  by  inputting  a  clock  signal  with  a  frequency  of  200  //.  Therefore,  the  main  function  of 
the  microcontroller  was  to  measure  the  vibration  frequency  and  output  a  clock  signal  that  had  a  period  of 
1 /200t/l  the  period  of  natural  vibration  of  the  beam. 

The  period  of  the  beam  vibration  was  determined  with  a  simple  algorithm  that  used  external  interrupts 
to  count  the  time  between  successive  rising  edges  of  the  phase-locked  loop  output.  The  output  of  the  phase- 
locked  loop  was  input  into  the  external  interrupt  pin  of  the  microcontroller.  On  the  first  low-to-high  transition 
the  microcontroller  started  incrementing  a  dummy  variable  that  increased  by  one  on  each  successive  rising 
edge.  On  the  fifth  rising  edge  the  microcontroller  began  incrementing  a  counter  every  1.6  ps  using  a  16-bit 
timer  with  a  prescaler  of  8.  On  the  next  rising  edge  the  timer  variable  was  read  and  the  period  of  the  beam 
vibration  (as  measured  by  the  output  of  the  phase-locked  loop)  was  computed.  The  computed  period  was 
then  divided  by  200  and  the  microcontroller  began  outputting  a  square  wave  with  a  period  of  l/200t/l  of  the 
period  of  the  beam  vibration. 

The  five-cycle  delay  was  incorporated  into  the  algorithm  to  improve  the  accuracy  of  the  measurement. 
Initial  tests  indicated  that  without  a  short  delay  the  algorithm  would  often  measure  inaccurate  values  of 
the  vibration  period.  The  errors  were  attributed  to  the  fact  that  the  first  rising  edge  of  the  phase-locked 
loop  output  did  not  always  correspond  to  a  full  cycle  of  the  vibration  period,  therefore  the  microcontroller 
would  not  be  able  to  accurately  calculate  the  period.  Placing  a  five-cycle  delay  in  the  algorithm  significantly 
improved  the  accuracy  of  the  period  measurement,  although  it  is  probable  that  a  one  or  two  cycle  delay 
would  have  also  had  the  same  beneficial  effects. 

Once  the  period  had  been  computed,  the  microcontroller  would  output  the  DPA  clock  frequency  until  it 
was  reset  by  the  mechanical  trigger.  The  mechanical  trigger  consisted  of  a  lowpass  filter  with  a  relatively 
low  break  frequency  -  approximately  2  Hz  -  and  a  Schmitt  Trigger  that  went  from  high  (+5  V)  to  low 
0  V)  when  the  lowpass  filter  output  reached  its  maximum  value.  The  lowpass  filter  served  two  purposes:  it 
filtered  out  the  oscillating  component  of  the  charge  amp  output,  which  eliminated  false  resets  when  the  beam 
was  vibrating,  and  it  provided  a  controlled  step  response  that  reached  its  maximum  value  approximately  1 
or  2  seconds  after  the  beam  was  held  back  at  the  tip.  Holding  the  beam  back  approximately  1  or  2  seconds 
reset  the  microcontroller  and  enabled  the  filter  adapt  to  the  vibration  frequency  of  the  beam.  This  provided 
a  simple  method  of  adapting  the  filter  parameters  without  a  significant  change  in  the  beam  demonstration. 

Bridge  Amplifier 

The  piezoceramic  actuators  were  driven  with  a  bridge  amplifier  to  increase  their  control  authority.  Due  to 
the  fact  that  the  beam  vibration  was  relatively  low  frequency  (<  30  Hz),  standard  operational  amplifiers 
with  approximately  20  mA  of  current  output  could  be  used  to  drive  the  actuators.  The  bridge  amp  was 
implemented  on  a  LM324  quad  op-amp  that  was  operated  from  a  ±18  VDC  supply,  therefore  the  bridge 
amplifier  was  able  to  deliver  a  maximum  of  ±36  to  the  actuators. 


Digitally-Programmable  Analog  Implementation 
of  the  Adaptive  Filter 

The  effectiveness  of  the  digitally-programmable  adaptive  filter  was  demonstrated  in  a  series  of  experiments 
on  a  flexible  composite  beam.  The  experimental  setup,  as  shown  in  Figure  8,  consisted  of  a  0.256  m  long 
flexible  clamped-free  beam  constructed  from  6  plys  of  graphite  epoxy.  The  width  of  the  beam  was  4.7  cm 
and  the  thickness  was  1.52  mm.  Piezoceramic  actuators  were  bonded  on  both  sides  of  the  beam  near  the 
fixed  end  and  a  single  piezoceramic  sensor  was  bonded  at  approximately  1/3  the  length  of  the  beam.  The 
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Figure  8:  Schematic  of  the  experiment  demonstrating  the  digitally-programmable  adaptive  filter. 


adaptive 
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piezoceramic  sensor  was  a  single  0.254  cm  layer  while  each  piezoceramic  actuator  consisted  of  two  0.254  cm 
thick  layers  bonded  on  top  of  one  another.  Two  layers  were  bonded  on  top  of  one  another  to  increase  the 
control  authority  of  the  structural  actuators.  A  clip  was  placed  at  the  free  end  of  the  beam  so  that  mass 
could  be  easily  added  to  the  beam. 

The  control  circuit  was  connected  by  hooking  up  the  output  of  the  piezoceramic  sensor  to  the  input  of 
the  charge  amplifier  and  connecting  the  output  of  the  bridge  amplifier  to  the  piezoceramic  actuators  (see 
Figure  8).  The  circuit  itself  was  powered  by  a  set  of  six  9  V  batteries;  four  of  the  batteries  provided  the 
±  18  VDC  for  the  bridge  amplifier  and  the  remaining  two  batteries  powered  the  ±  5  VDC  for  the  digital 
components  of  the  circuit. 

The  performance  of  the  beam  was  measured  by  exciting  the  structure  with  an  impact  hammer  and 
measuring  the  output  of  the  surface  bonded  piezoceramic  sensor.  The  input-output  transfer  function  was 
measured  with  a  spectrum  analyzer  over  the  frequency  range  0  to  200  Hz.  Closed-loop  damping  ratios  were 
measured  by  curvefitting  the  measured  frequency  response  and  extracting  the  structural  parameters  from 
the  best  fit  of  the  data. 

The  experiments  were  performed  in  two  stages.  First,  an  off-line  identification  procedure  was  used  to 
determine  the  gain  of  the  feedback  filter.  An  appropriate  choice  for  the  filter  gain  was  obtained  using  the 
measured  frequency  response  between  the  piezoceramic  actuators  and  the  piezoceramic  sensor.  The  DC 
asymptote  of  the  frequency  response  was  measured  from  the  experimental  data,  and  a  filter  gain/DC  gain 
ratio  of  0.4  was  chosen  so  that  approximately  20%  critical  damping  would  be  obtained.  This  gain  was  then 
fixed  throughout  the  remainder  of  the  control  experiments. 

The  performance  of  the  adaptive  filter  was  examined  with  three  sets  of  experiments.  In  the  first  set  of 
experiments,  the  filter  was  adapted  to  the  natural  frequency  of  the  beam  without  any  added  tip  mass  and 
then  additional  mass  was  placed  at  the  free  end.  For  each  value  of  the  added  mass,  a  frequency  response 
between  the  impact  hammer  and  the  piezoceramic  sensor  output  was  obtained  to  determine  the  performance 
of  the  nonadaptive  filter.  Five  values  of  mass  ranging  from  8.21  g  to  38.1  g  were  added  to  the  beam,  which 
changed  the  natural  frequency  of  the  beam  from  24.1  Hz  (without  any  added  mass)  to  approximately  11.3 
Hz  -  over  a  factor  of  two  variation  in  the  frequency. 

The  second  set  of  experiments  was  performed  by  adapting  the  filter  to  the  11.3  Hz  natural  frequency 
and  removing  mass  from  the  free  end.  As  with  the  previous  experiment,  the  frequency  response  between 
the  impact  hammer  and  the  piezoceramic  sensor  was  measured  for  each  value  of  removed  mass.  This  set  of 
experiments  was  performed  to  analyze  the  performance  of  the  nonadaptive  filter  that  has  been  designed  to 
add  damping  to  the  11.3  Hz  mode  of  the  beam. 
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Figure  9:  Measured  damping  ratios  from  the  nonadaptive  and  adaptive  filter  tests:  filter  adapted  to  24  Hz 
mode  (left),  and  filter  adapted  to  11.3  Hz  mode  (right). 

The  final  set  of  experiments  analyzed  the  effectiveness  of  the  adaptive  filter  that  tracked  the  frequency 
of  the  beam.  Starting  with  no  additional  mass  on  the  free  end  of  the  beam,  the  filter  was  adapted  to 
the  first  natural  frequency  of  the  beam  and  the  performance  transfer  function  was  measured.  Next,  the 
smallest  weight  was  added  to  the  free  end  and  the  filter  was  adapted  by  mechanically  triggering  a  reset  in 
the  microprocessor.  A  second  frequency  response  was  obtained  of  the  beam  with  the  adapted  filter.  This 
procedure  was  repeated  for  the  remaining  three  values  of  added  mass,  and  the  resulting  set  of  data  was 
analyzed  to  determine  the  achievable  damping  ratio  for  a  filter  that  tracks  the  first  natural  frequency  of  the 
vibrating  beam. 

The  complete  set  of  eighteen  measurements,  along  with  a  set  of  six  open-loop  measurements,  were  an¬ 
alyzed  to  determine  the  closed-loop  structural  damping  for  each  value  of  the  added  mass.  The  results  are 
plotted  in  Figure  9.  The  plot  on  the  left  illustrates  the  closed-loop  damping  ratios  achieved  for  the  nonadap¬ 
tive  filter  tuned  to  the  24  Hz  mode  and  the  closed- loop  damping  ratios  of  the  adaptive  filter.  As  expected, 
the  damping  ratios  achieved  with  the  nonadaptive  filter  drop  sharply  as  the  natural  frequency  of  the  beam 
decreases.  Approximately  a  factor  of  three  reduction  (from  17%  critical  to  roughly  6%  critical)  in  the  closed- 
loop  damping  ratio  occurs  as  the  beam  frequency  changes  from  24  Hz  to  11.3  Hz.  In  contrast,  the  adaptive 
filter  maintains  between  14%  critical  and  20%  critical  damping  as  the  beam  frequency  drops.  For  the  lowest 
frequency  mode,  11.3  Hz,  the  adaptive  filter  has  approximately  three  times  more  closed-loop  damping  as  the 
nonadaptive  filter. 

Similar  trends  occur  when  the  filter  is  adapted  to  the  lowest  frequency  structural  mode.  The  plot  on  the 
right  in  Figure  9  demonstrates  that  the  drop  in  performance  is  even  more  pronounced  when  the  structural 
mode  increases  in  frequency  and  the  filter  parameters  are  fixed.  In  this  case,  the  closed-loop  structural 
damping  reduces  from  20%  critical  to  less  than  5%  critical  as  the  beam  frequency  increases  from  11.3  Hz  to 
24  Hz.  As  with  the  previous  case,  the  adaptive  filter  maintains  a  more  constant  level  of  performance  over 
the  range  of  added  mass. 

Figure  10  is  a  more  detailed  illustration  of  the  effects  of  filter  adaptation.  The  open-loop  frequency 
response  between  the  impact  hammer  input  and  the  piezoceramic  sensor  output  is  compared  to  the  frequency 
response  after  the  filter  has  adapted  to  the  24  Hz  mode  (curves  1  and  2).  Comparing  these  curves,  we  see  that 
adapting  the  filter  to  the  24  Hz  natural  frequency  reduces  the  peak  response  by  approximately  a  factor  of  10 
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Figure  10:  Frequency  responses  for  no  control,  no  added  mass  (1);  24  Hz  filter,  no  added  mass  (2);  24  Hz 
filter,  38.1  g  added  mass  and  an  11.3  Hz  structural  mode  (3);  and  11  Hz  filter  and  38.1  g  added  mass  (4). 

and  increases  the  damping  ratio  from  ~  1%  to  17%.  Adding  the  maximum  tip  mass  reduces  the  resonance  of 
the  mode  to  11.3  Hz  and  reduces  the  performance  of  the  fixed-parameter  filter  (curve  3).  Adapting  the  filter 
to  the  11.3  Hz  mode  recovers  the  optimal  performance  and  increases  the  structural  damping  from  «  5%  to 
slightly  greater  than  20%  (curve  4). 

Filter  Adaptation 

As  previously  described,  the  frequency  adaptation  algorithm  was  restarted  with  a  mechanical  trigger  con¬ 
nected  to  the  sensor  output  of  the  beam.  This  adaptation  occurred  over  approximately  a  500  fis  interval 
after  the  microcontroller  had  been  reset. 

Although  the  frequency  adaptation  algorithm  was  a  reliable  method  of  determining  the  vibration  fre¬ 
quency  of  the  beam,  there  were  instances  in  which  the  algorithm  identified  an  incorrect  natural  frequency. 
These  anomalies  were  attributed  to  the  timing  of  the  beam  vibration  with  respect  to  the  start  of  the  identi¬ 
fication  algorithm.  The  algorithm  was  sensitive  to  incorrect  timing  and  would  identify  the  wrong  vibration 
frequency  approximately  5  to  10%  of  the  time. 

Making  the  algorithm  more  reliable  would  involve  a  more  detailed  analysis  of  the  timing  of  the  mechan¬ 
ical  trigger  and  the  frequency  identification  algorithm.  It  is  probable  that  the  a  more  reliable  triggering 
system,  one  less  prone  to  the  user  vibrating  the  beam  in  the  right  manner,  could  be  coded  into  the  micro¬ 
controller.  This  would  involve  examining  the  output  of  the  phase-locked  loop  and  determining  whether  the 
microcontroller  is  counting  the  correct  cycle  of  the  PLL  output. 


Conclusions 

PIC  microcontrollers  are  viable  processors  for  implementing  active  damping  control  laws,  but  certain  limi¬ 
tations  exist  due  the  current  processor  architecture.  The  speed  of  the  processor  limits  digital  filter  sampling 
rates  to  roughly  5  kHz /N,  where  N  is  the  number  of  second-order  filter  sections.  Optimizing  the  current 
code  and  using  a  33  MHz  processor  (as  opposed  to  the  20  MHz  processor  used  for  these  studies)  would 
increase  the  sampling  rate  to  approximately  9  kHz /N.  Assuming  the  maximum  achievable  filter  frequency 
to  be  10  times  less  than  the  sampling  rate,  these  limitations  restrict  the  highest  filter  frequency  to  between 
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500  Hz  and  900  Hz  per  second-order  section.  Fixed- point  processing  limits  the  lowest  frequency  achievable 
with  the  PIC  microcontroller.  For  8-bit  arithmetic  this  limitation  is  approximately  0.03  times  the  sampling 
rate,  whereas  for  16-bit  arithmetic  this  limitation  is  approximately  0.002  times  the  sampling  rate.  Thus, 
sampling  rate  limitations  and  the  use  of  fixed-point  arithmetic  places  an  upper  and  lower  bound  on  the 
frequency  of  the  feedback  compensators. 

More  design  freedom  is  achieved  by  offloading  the  filter  implementation  to  a  digitally-programmable 
analog  filter  and  using  the  PIC  microcontroller  in  a  ‘supervisory’  role.  This  ‘hybrid’  approach  eliminates  the 
problems  due  to  sampling  rate  and  fixed-point  arithmetic  by  reducing  the  number  of  real-time  computations 
that  must  be  performed  by  the  microcontroller.  A  circuit  consisting  of  a  digitally-programmable  analog  filter, 
a  PIC16C65A  microcontroller,  and  a  digital  phase-locked  loop  was  designed  and  tested  in  a  series  of  control 
experiments.  The  feedback  compensator  was  implemented  on  the  digitally-programmable  analog  filter,  and 
a  frequency-tracking  algorithm  was  implemented  using  the  phase-locked  loop  and  the  PIC  microcontroller. 
The  structure  was  a  flexible  clamped-free  beam  with  a  variable  tip  mass  that  changed  the  natural  frequency 
from  24  Hz  to  approximately  11  Hz.  A  series  of  tests  demonstrated  that  the  closed-loop  damping  achieved 
with  a  fixed- parameter  filter  would  be  reduced  by  approximately  a  factor  of  three  as  the  frequency  varied 
from  24  Hz  to  11  Hz.  A  second  set  of  control  experiments  demonstrated  that  the  adaptive  filter  could 
reliably  track  the  natural  frequency  of  the  beam  in  approximately  500  /xs  and  maintain  between  14%  and 
20%  damping  in  the  first  structural  mode. 

One  possible  extension  of  this  work  would  focus  on  the  design  of  an  adaptive  filter  that  suppresses  forced 
vibrations  in  the  beam.  The  current  algorithm  is  essentially  a  frequency-tracking  approach  that  assumes 
that  the  gain  and  damping  of  the  filter  has  been  predetermined  through  an  off-line  identification.  A  more 
general  algorithm  would  also  include  on-line  identification  of  the  optimal  control  gain  and  filter  damping 
ratio.  It  is  likely  that  this  algorithm  could  be  implemented  using  the  current  circuit  design  due  to  the  fact 
that  there  is  sufficient  memory  available  in  the  microcontroller. 
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Abstract 

Near  infrared  radiation  for  imaging  tumors  in  tissue  has  been  recently  explored  and  investigated. 
In  particular,  to  validate  a  novel  FFT  imaging  reconstruction  algorithm,  we  have  performed  experiments 
on  tissue  phantoms  using  a  frequency-domain  photon  migration  system.  The  system  includes  an 
amplitude-modulated  laser  diode,  two  radio  frequency  (RF)  signal  generators,  an  avalanche  photo 
detector,  and  a  lock-in  amplifier.  The  tissue  phantoms  were  made  with  inhomogeneities  imbedded  and 
were  then  scanned  with  the  system  at  various  RF  frequencies.  The  data  from  the  amplitude  and  phase 
measurements  were  analyzed  using  the  FFT  reconstruction  algorithm.  The  reconstructed  data  show  clear 
validation  of  the  FFT  algorithm  and  afford  to  localize  inhomogeneities  hidden  in  turbid  media  in  three 
dimensions.  In  addition,  based  on  the  results,  we  present  preliminary  analysis  on  optimization  of 
experimental  parameters  to  obtain  good-quality,  reconstructed  images  with  best  experimental  efficiency. 


22-2 


EXPERIMENTAL  VALIDATION  OF  THREE-DIMENSIONAL  RECONSTRUCTION 
OF  INHOMOGENEITY  IMAGES  IN  TURBID  MEDIA 


Hanli  Liu 

L  Introduction 

In  the  last  few  years,  numerous  research  efforts  and  technology  developments  have  been  made 
towards  using  near  infrared  (NIR)  radiation  for  human  tissue  quantification  and  imaging.  Three  forms  of 
light  used  in  the  field  include  short-pulsed  light  in  time-domain  (TD),  intensity-modulated  light  in 
ffequency-domain(FD),  and  a  continuous  wave  form  (CW).  Among  them,  the  frequency-domain  system 
has  its  advantages  over  the  other  two:  a  FD  system  can  acquire  data  almost  in  real  time  in  an  order  of  ms, 
much  faster  than  a  TD  system,  which  utilizes  a  single  photon  counting  method  and  requires  much  longer 
data  acquisition  time  (in  multi  seconds).  In  addition,  with  the  help  of  diffusion  theory,  the  FD  system 
allows  us  to  extract  the  optical  properties  of  tissues/organs  under  study  and  to  image  inhomogeneities 
hidden  in  tissues/organs.  In  comparison,  CW  method  is  in  lack  of  quantification  unless  close-distanced 
(2-5  mm)  measurements  are  performed. 

The  light  source  used  for  a  FD  system  is  intensity  modulated  at  a  radio  frequency  (RF)  ranging 
from  several  MHz  up  to  GHz  at  the  wavelength  range  from  650  nm  to  900  nm.  In  principle,  NIR  light 
illuminating  tissues  becomes  very  much  scattered  about  1  mm  after  the  light  travels  in  tissue.  In  the  FD 
case,  the  intensity-modulated  light  travelling  in  tissue  will  form  diffuse  photon  density  waves  (DPDW’s), 
and  the  amplitude  and  phase  of  DPDW’s  carry  information  on  the  optical  properties  of  the  tissue.  By 
extracting  amplitude,  phase,  or  optical  properties  of  thick  tissue,  we  can  reconstruct  tissue  images  that 
show  inhomogeneities  imbedded  inside  the  thick  tissue.  This  provides  a  new  methodology  for  tumor 
imaging  in  medical  applications. 

Over  last  few  years,  common  geometries  used  for  imaging  studies  are  1)  source(s)-detector  pairs 
scanning  horizontally  or  2)  circularly.1  In  both  cases,  mechanical  movement  is  required,  and  it  provides 
only  two-dimensional  images.  Recently,  Matson  et  al  proposed  a  new  approach  to  three-dimensional 
tumor  localization  in  turbid  media  with  the  use  of  FD  measurements  in  a  single  plane.2  The  theoretical 
basis  of  this  novel  reconstruction  algorithm  stems  from  Fourier  Optics:  applying  Fast  Fourier  Transform 
(FFT)  to  the  diffusion  equation  that  DPDW’s  obey,  and  then  extending  the  solution  of  Fourier  Optics  to 
DPDW’s.  This  FFT  reconstruction  algorithm  has  several  advantages,  such  as,  1)  the  light  source  does  not 
need  to  move  mechanically,  2)  it  has  a  potential  to  avoid  mechanical  movement,  which  has  been  shown  to 
produce  significant  errors  in  imaging  reconstruction.  The  theory  has  yet  to  be  verified  by  experiments, 
although  it  was  supported  by  computer  simulations  at  one  RF  frequency  of  1  GHz  with  selected  scanning 
parameters. 
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The  goal  of  this  summer  research  project  is  to  experimentally  validate  and  optimize  this  three- 
dimensional  reconstruction  algorithm  to  image  inhomogeneities  in  turbid  media.  In  this  report,  we  will 
first  introduce  the  theoretical  foundation  for  the  FFT  reconstruction  algorithm,  followed  by  a  brief 
description  on  experimental  setups,  materials  and  methods.  Then  we  will  show  experimental  results  and 
reconstructed  images,  and  then  investigate  the  correlation  among  experimental  parameters  that  affect  the 
quality  of  the  reconstructed  images.  Furthermore,  computer  simulations  are  performed  to  confirm  the 
findings  that  a  larger  scanning  area  is  more  crucial  than  smaller  scanning  pixels  to  obtain  a  higher 
resolution  image.  Finally,  we  will  draw  conclusions  from  this  study  and  indicate  future  work. 

2.  Theoretical  Background 

Based  on  Fourier  Optics3,  we  define  the  following  relations: 


(1) 

Uix,y,z)  =  14  , 

(2) 

where  U(x,  y,  z)  is  the  complex  field  of  a  monochromatic  wave  across  the  plane  at  z,  and  Az  is  a  two- 
dimensional  Fourier  transform  of  the  function  U(x,  y,  z).  It  can  be  shown  that  a  free-space  solution  to  the 
source-free  Helmholtz  equation  exists  in  the  form  of  spatial  frequency  spectrum  across  a  plane  parallel  to 
the  xy  plane  but  at  a  distance  z  from  it.3 

In  addition,  it  has  been  proved4  that  the  diffusion  equation  adequately  represents  photon 
propagation  in  tissue,  and  the  source-free  diffusion  equation  in  FD  can  be  written  as 

V2U(x,y,z)  +  k2U(x,y,z)  =  0,  (3) 

where  U(x,  y,  z)  is  the  spatial  distribution  of  a  DPDW  in  a  scattering  medium,  and  k2=(-vjua+ico)/D 
=3ps’(-pa+to/v),  v  is  the  speed  of  light  travelling  in  tissue,  ©  is  the  modulation  frequency  of  the  light 
multiplied  by  2n ,  and  pa  and  ps’  are  absorption  and  reduced  scattering  coefficients  of  the  scattering 
medium,  respectively. 

Now  inserting  the  inverse  Fourier  transform  definition,  eq.  (2),  into  the  diffusion  equation,  eq. 
(3),  results  in: 


dz 2 


A(fx’fv)  + 


k2 -(2tfxf  ~(2tfyy 


4(£>/v)=°. 


(4) 


In  analogy  to  the  solution  obtained  in  Fourier  Optics  for  regular  plane  waves,  we  can  find  a  solution  to  eq. 
(4)  using  the  Fourier  transform  definition: 


Azl[fxJy)=AZ0(fxJy)e 


i^k2 -[2gxf 


(5) 
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where  Az=Zo-Zi,  is  the  distance  between  the  detection  plane  at  Zo  and  the  reconstruction  plane  z\. 
Furthermore,  we  can  define  a  transfer  function,  H^fx,  fy),  describing  how  a  DPDW  propagates  in  a 
homogeneous  medium, 


H, 


Az 


[”fy]  w 


/Az. 


^-v/4  +z<yN 


£> 


-(2^)2-(2^v): 


(6) 


It  is  worthwhile  to  notice  several  important  points: 

1)  in  using  diffusion  theory,  we  neglect  boundary  effects  and  assume  the  medium  to  be  homogeneous 
with  spherical  inhomogeneities; 

2)  in  applying  Fourier  Optics  to  obtain  equation  (5),  we  extend  the  wave  vector,  k,  from  a  real  number  to 
a  complex  number  to  include  scattering  effects,  as  the  diffusion  equation  does; 

3)  eq.  (6)  describes  the  change  in  the  Fourier  transform  of  the  photon  density  wave  as  it  propagates,  and 
thus  eq.  (6)  can  be  used  to  invert  the  propagation  and  to  reconstruct  the  photon  density  wave  behind 
the  detection  plane. 

3.  Necessity  of  Experiments 

The  primary  goal  of  this  summer  research  project  is  to  experimentally  validate  the  FFT  reconstruction 
algorithm,  as  introduced  above.  Taking  experiments  is  necessary  because 

1)  experiments  permit  studying  boundary  effects  on  the  reconstructed  images  by  using  finite  sizes  of 
tissue  phantoms; 

2)  we  can  study  instrument  detectability  and  sensitivity  in  locating  hidden  objects  with  different 
sizes  and  optical  properties; 

3)  we  can  optimize  scan  parameters  by  performing  experiments  with  various  scan  dimensions,  pixel 
sizes,  and  modulation  frequencies; 

4)  measurement  on  homogeneous  samples  provide  feasibility  of  obtaining  correct  background 
information  for  the  reconstruction  and  demonstrate  the  importance  of  such  information. 

4.  Experimental  Setup 

As  shown  in  Figure  1,  the  electronic  components  of  our  FD  system  include  two  RF  signal  generators, 
(Rhode  &  Schwarz,  SMY01)  ranging  from  9  kHz  to  1.04  GHz,  two  50%  power  splitters  (Mini-Circuits, 
ZFSC-2-1),  two  RF  amplifiers  (Mini-Circuits,  ZFL-1000H),  a  NIR  laser  diode  (LaserMax  Inc,  LSX- 
3500)  modulatable  up  to  100  MHz  at  780  nm  and  680  nm  with  laser  powers  of  3  mW  (at  1 16  mA)  at  9 
mW,  an  Avalanche  Photo  Diode  (APD)  (Hamamatsu,  C5331),  two  frequency  mixers  (Mini-Circuits, 
ZLW-2),  two  low  pass  filters  with  the  comer  frequency  at  1  MHz(Mini-Circuits,  SLP-1.9),  a  lock-in 
amplifier  (Stanford  Research  Systems,  SR850),  and  a  computer-controlled,  two-dimensional  positioning 
stage  (Aerotech,  Inc.  ATS  100). 
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Figure  1.  Experimental  Setup  of  a  FD  system 


The  frequencies  of  the  two  signal  generators  were  chosen  from  10  to  100  MHz  with  an  offset 
Af=25  kHz  for  the  reference  signal  with  respect  to  the  measurement  signal  in  order  to  achieve  a 
heterodyne  system.  Both  RF  signals  for  the  reference  at  f+Af  and  for  the  measurement  at  f  were  50% 
power  split,  and  a  branch  from  each  the  two  RF  signal  generator  was  mixed  by  Mixer  1  to  generate  a 
reference  signal  for  a  lock-in  amplifier.  The  output  levels  of  the  two  signal  generators  were  set  up  with  a 
ratio  of  7  to  1  between  the  reference  and  measurement  signals  to  optimize  the  mixing  efficiency.  The 
second  branch  of  the  measurement  signal  at  f  was  amplified  to  gain  enough  power  to  modulate  the 
intensity  of  a  NIR  laser  diode  at  780  nm.  Then  the  intensity-modulated,  manufacture-collimated  laser 
light  illuminated  a  tissue  phantom  and  then  was  detected  by  a  movable  APD  to  convert  the  optical  signal 
to  electrical  signal.  A  second  amplifier  amplified  the  output  of  APD  before  the  electrical  signal  was 
mixed  with  the  second  branch  of  the  reference  signal  at  f+Af  using  Mixer  2.  The  output  frequencies  of 
Mixer  1  and  Mixer  2  consisted  of  2f+Af,  f+Af,  f,  Af,  and  higher  harmonic  components.  By  utilizing  two 
low  pass  filters  with  a  comer  frequency  of  1  MHz,  only  the  signal  with  frequency  of  Af  at  25  kHz  passes 
to  the  lock-in  amplifier.  The  corresponding  readings  for  amplitude  and  phase  were  recorded 
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automatically  by  a  computer;  in  turn,  feedback  from  the  computer  controlled  the  sensitivity  and  time 
constant  settings  for  the  lock-in  amplifier  based  on  the  readings.  In  addition,  through  a  GPIB  interface, 
the  computer  set  the  RF  frequencies  for  the  two  signal  generators  and  controlled  a  XY  positioning  stage, 
which  held  the  APD  for  2-dimensional  imaging  scans. 

5.  Materials  and  Methods 

Tissue  phantoms  were  made  out  of  plastic  resin  that  hardens  in  molds  by  adding  catalyst.  The  resin  is 
combined  with  Ti02  powder  for  scattering  property  and  sometimes  with  NIR  dye  for  absorption 
property.5  The  optical  properties  of  homogeneous  phantoms  were  determined  using  the  slope  algorithm6 
in  transmission  geometry.  Since  the  algorithm  requires  the  information  on  the  homogeneous  background 
medium  that  contains  an  object/objects,  multiple  samples  were  made  as  a  set  at  the  same  time  to  ensure 
that  the  samples  of  the  set  have  the  same  background  optical  properties.  In  general,  a  set  of  phantoms 
included  a  homogeneous  sample,  and  two  others  with  different  objects  implanted  inside.  The  phantom 
sizes  were  typically  14cm  x  14cm  x  5cm,  and  absorption  and  scattering  coefficients  of  background  media 
were  within  a  range  of  0.01  to  0.1  cm'1  and  3  to  17  cm'1,  respectively.  The  objects  used  as 
inhomogeneities  were  plastic  black,  white,  and  clear  beads  with  various  diameters  and  shapes,  as  shown 
and  listed  in  Figure  2. 


LB  (large  black) 

13  mm 

o 

SW  (small  white) 
4  mm 


12  mm 


O 


SB  (small  black)  LW  (large  white) 
9  mm  8  mm 


6  mm 


o 


MW  (medium  white) 
6  mm 


LC  (large  clear)  MC  (medium  clear)  CB  (cylindrical  black) 


19  mm  x  6  mm 


Figure  2.  Beads  used  as  tumor  objects  imbedded  in  tissue  phantoms 

We  will  use  a  XYZ  coordinate  system,  shown  in  Figure  3,  in  this  report  to  refer  to  the  positions  of 
hidden  objects  in  the  phantoms.  The  XY  plane  is  in  parallel  with  the  detection  plane,  the  origin  is  at  the 
center  of  the  detection  plane,  and  the  Z  axis  is  pointing  from  the  detection  plane  into  the  phantom. 
Normally,  a  bead  was  placed  at  the  center  of  a  phantom  with  a  coordinate  of  (0,  0,  d/2  cm),  where  d  is  the 
thickness  of  the  phantom.  For  multiple  hidden  tumors,  two  beads  (the  same  or  different  kinds)  were 
placed  at  (0,  -2  cm,  -d/2  cm)  and  (0,  2  cm,  -d/2  cm)  inside  of  a  phantom.  In  making  inhomogeneous 
phantoms,  we  positioned  the  object(s)  on  top  of  a  semi-dried,  background  material  which  has  been  cast 
earlier,  and  then  added  equal  amount  of  ready-to-cast  resin  on  top  of  the  first  part  to  complete  the 
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phantom.  In  this  way,  we  implanted  the  objects  inside  the  phantoms  with  good  3-dimensional  coordinate 
references,  and  X-ray  measurements  have  confirmed  the  positions  of  imbedded  objects. 


Figure  3.  the  xyz 
coordinate  system  used  to 
label  locations  of  hidden 
objects  inside  a  phantom. 


All  of  the  scans  used  for  this  report  were  performed  on  areas  of  8  cm  x  8  cm  having  64x64, 
32x32,  or  16x16  pixels  with  pixel  spacing  of  0.125  cm,  0.25  cm,  and  0.5  cm,  respectively.  The 
modulation  frequencies  were  scanned  from  10  MHz  to  100  MHz  in  various  steps  in  actual  experiments, 
but  all  the  data  shown  in  this  report  were  taken  mainly  at  20  or  40  MHz.  An  average  measurement  time 
was  about  2.5  hours  for  a  32x32  scan  with  two  modulation  frequencies  (20  and  40  MHz).  The 
experiments  were  performed  in  transmission  geometry  with  the  light  source  fixed  at  one  side  of  the 
sample  and  the  movable  detector  located  on  the  other  side  of  the  sample.  The  software  package 
Interactive  Data  Language  (IDL)  was  used  to  process  the  measured  data  and  to  perform  the  reconstruction 
algorithm.  In  addition,  the  PMI7  simulation  code  was  used  to  predict  and  confirm  the  experimental  data. 

Because  the  transfer  function  contains  an  exponential  decay  term,  the  reconstructed  DPDW’s  are 
not  stable,  containing  ringing  patterns.  A  better  understanding  for  this  artifact  is  under  investigation. 
However,  at  present  stage,  to  stabilize  the  reconstructed  data  or  eliminate  ringing  patterns,  we  utilized  a 
low-pass  filter  (pillbox)  to  cut  off  high  spatial  frequency  components.  Specifically,  the  cutoff  frequency 
was  ranged  from  only  a  few  pixels,  such  as  1-4,  in  the  spatial  frequency  domain.  We  multiplied  the 
Fourier  spectra  by  this  cutoff  filter  before  inverse  Fourier  transforming  the  spatial  frequency  spectra  to 
reconstruct  the  DPDW’s.  For  convenience,  in  the  rest  of  the  report,  we  will  use  “filter”  as  a  short  notation 
to  represent  the  pixel  number  in  Fourier  domain  of  cutoff  spatial  frequencies. 

After  the  amplitude,  A,  and  phase,  0,  were  measured  as  a  function  of  positions  in  a  XY  plane  at 
z=zo,  we  followed  the  procedures  given  below  to  reconstruct  three-dimensional  images: 

1)  based  on  A  and  0,  calculate  complex  photon  density  waves  Uh(x,y,zo)  and  Uinh(x,y,zo), 
respectively,  for  both  homogeneous  sample  and  the  sample  containing  objects; 

2)  subtract  the  homogeneous  wave  from  the  inhomogeneous  wave,  U(x,y,Zo)=Ujn(x,y,zo)- 
Uh(x,y,zo); 
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3)  FFT  U(x,y,Zo)  to  obtain  its  Fourier  transform  A(fx,  fy,  Zo); 

4)  multiply  A(fx,  fy,  zo)  by  the  transfer  function  to  execute  the  reconstruction  process  at  Zi ; 

5)  multiply  A(fx,  fy,  Zi)  by  a  low-pass  filter  with  a  selected  filter  number  (cutoff  pixels); 

6)  inverse  FFT  A(fX)  fy,  zi)  to  the  spatial  domain  for  the  reconstructed  image  at  z,; 

7)  repeat  steps  4)  to  6)  for  different  values  of  Z\  to  back  propagate  the  reconstruction; 

8)  determine  the  x,  y  coordinates  of  the  hidden  object  at  the  detection  plane,  and  then  plot 
amplitude  versus  z  to  localize  the  tumor  in  Z  direction. 

6.  Experimental  Results  and  Image  Reconstructions 

Figure  4  shows  an  example  of  amplitude  images  of  DPDW’s  measured  at  the  detection  plane 
using  a  680  nm  laser  scanned  with  32x32  pixels  on  a  8  cm  x  8  cm  area  of  a  phantom  containing  one  black 
object.  The  phantom  was  4.7  cm  thick,  and  the  black  bead  had  a  diameter  of  9  mm  imbedded  at  (0,  0,  2.3 
cm).  The  illumination  spot  on  the  phantom  was  located  at  (-1.5  cm,  0,  2.3  cm),  1.5  cm  off-centered  in  x 
axis  with  respect  to  the  center  of  the  phantom.  The  modulation  frequency  used  for  this  set  of 
measurements  was  20  MHz,  and  the  filter  used  for  the  reconstruction  was  2.  The  absorption  and  reduced 
scattering  coefficients  of  the  background  medium  were  0.14+0.06  and  3.1±1.4  cm'1,  respectively,  which 
were  determined  using  a  homogeneous  sample.  Three  amplitude  plots  of  the  complex  DPDW’s  in  Figure 
4  are  obtained  at  the  detection  plane  for  a  homogeneous  field  [4(a)]  measured  from  the  homogeneous 
sample,  perturbed  field  [4(b)]  from  the  inhomogeneous  sample,  and  the  deviation  between  the  two  fields 
[4(c)],  i.e.,  perturbed  field  (pfield)  -  homogeneous  field  (hfield).  Notice  that  in  obtaining  the  image  of 
4(c),  we  first  subtracted  the  two  complex  fields  (pfield-hfield),  which  were  calculated  from  amplitude  and 
phase  measurements  of  the  two  samples,  and  then  plotted  the  amplitude  of  the  subtracted  field. 


Figure  4.  Amplitude  images  of  the  DPDW’s  measured  at  the  detection  plane  from  a 
phantom  with  one  hidden  high-absorbing  object,  scanned  at  20  MHz  with  an  array 
size  of  32  x  32.  a)  homogeneous  field=hfield,  b)  perturbed  field=pfield,  c)  pfield 
hfield. 

Figure  5  uses  the  same  set  of  data  as  those  for  Figure  4.  In  the  case  of  Figure  5,  the  FFT 
reconstruction  algorithm  was  utilized  to  obtain  the  images  at  different  depths  behind  the  detection  plane. 
The  filter  used  here  was  2.  Let’s  first  look  at  Figures  5(c)  to  5(e),  which  are  amplitude  images  at  the 


22-9 


detection  plane,  1.25  cm,  and  2.5  cm  behind  the  detection  plane,  respectively.  The  center  of  the  actual 
black  object  is  2.3  cm  behind  the  detection  surface  and  1.5  cm  off  from  the  center  of  the  scanned  area  in 
X  direction.  These  three  images  clearly  and  consistently  show  the  location  of  the  center  of  a  hidden 
inhomogeneity  in  the  XY  plane  at  different  depths,  indicating  that  the  object  is  a  few  centimeters  deep 
behind  the  detection  plane.  For  more  quantitative  localization  of  the  object  in  Z  direction,  we  need  to 
have  a  XZ/YZ  image  or  a  plot  in  Z  direction  at  the  selected  x  and  y  coordinates  of  the  detected  object 
determined  from  the  previous  XY  images.  Figures  5(a)  and  5(b)  are  cross-section  images  at  XZ  and  YZ 
planes  passing  through  the  center  of  the  scanned  area.  The  top  of  Figure  5(a)  corresponds  to  the  detection 
plane  along  x  axis,  and  the  bottom  represents  the  illumination  plane.  In  this  case,  this  XZ  plane  goes 
through  the  center  of  the  object  horizontally  at  y=0,  and  one  can  see  that  the  imaged  object  is  1 .4  cm  off 
from  the  center  in  x  direction.  A  plot  of  amplitude  versus  Z  at  the  corresponding  x  coordinate  provides 
the  location  of  the  object  in  Z  direction  (not  shown).  Same  principle  applies  to  Figure  5(b),  in  which  case 
the  YZ  plane  is  at  x=0,  1 .5  cm  apart  from  the  plane  passing  through  the  center  of  the  object.  Because  of 
this,  this  image  does  not  give  accurate  location  of  the  object  in  Z  direction. 


Figure  5.  Reconstructed  images  for  a  phantom  with  one  object,  scanned  at  20  MHz  with  an  array 
size  of  32  x  32.  a)  x-z  plane:  the  plane  through  the  center  of  the  measured  volume  horizontally,  b)  y-z 
plane:  the  plane  through  the  center  of  the  measured  volume  vertically,  c)  x-y  plane:  the  detection  plane,  d) 
x-y  plane:  1.25  cm  behind  the  detection  plane,  e)  x-y  plane:  plane  near  the  object’s  center,  2.5  cm  behind 
the  detection  plane. 

Figure  6  is  obtained  from  another  set  of  two  phantoms,  one  of  which  is  homogeneous  and  the 
other  having  two  high-absorbing  objects  with  diameters  of  9  mm.  With  the  use  of  a  780  nm  laser 
modulated  at  20  MHz,  the  absorption  and  reduced  scattering  coefficients  of  the  background  medium  are 
found  to  be  0.015±0.001  cm  and  15.1  ±0.3  cm,  respectively.  The  thickness  of  the  inhomogeneous 
phantom  is  4.2  cm.  The  two  objects  were  imbedded  inside  the  phantom  with  a  separation  of  4  cm  apart  at 
(0,  -2  cm,  2,1  cm)  and  (0,  2  cm,  2.1  cm).  The  phantom  was  set  0.5  cm  off  centered  with  respect  to  the 
laser  and  the  scanning  plane  along  x  axis.  Similarly  to  Figure  5,  Figures  6(a)  and  6(b)  are  amplitude 
images  measured  from  the  homogeneous  and  inhomogeneous  phantoms,  and  subtraction  of  the  two 
complex  fields  of  the  two  samples  gives  Figure  6(c).  These  three  images  are  obtained  at  the  detection 
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plane  and  show  clearly  two  hidden  objects  after  the  simple  subtraction.  Furthermore,  if  the  FFT 
reconstruction  algorithm  is  applied,  we  can  obtain  the  reconstructed  amplitude  images  of  DPDW’s  at 
different  depths  behind  the  detection  plane.  Figures  7(c)  to  7(f)  show  such  amplitude  images 
reconstructed  at  the  detection  plane,  1  cm,  and  2  cm  behind  the  detection  plane,  respectively.  All  three 
images  indicate  consistently  and  unambiguously  two  hidden  objects  at  about  (0.5  cm,  2cm)  and  (0.5  cm,  - 
2  cm)  in  the  XY  plane  at  different  depths.  As  the  reconstructed  plane  goes  deeper  and  closer  to  the  center 
of  the  hidden  objects,  the  resolution  for  the  imaged  objects  becomes  better.  In  this  case,  a  YZ  image, 
shown  in  Figure  7(b),  supports  the  findings  from  the  XY  images,  and  it  also  provides  locations  of  the  two 
objects  in  depth,  which  occur  at  the  peaks  of  the  two  dark,  cylindrical  spots.  Since  the  image  plane  for 
Figure  7(a)  is  in  between  the  two  objects,  not  passing  through  either  center  of  the  hidden  objects,  this  XZ 
image  does  not  give  us  very  useful  information  in  locating  objects. 


Figure  6.  Amplitude  images  of  the  DPDW’s  measured  from  a  phantom  with  two  high- 
absorbing  objects,  scanned  at  20  MHz  with  an  array  size  of  32  x  32.  a)  hfield,  b)  pfield, 
and  c)  pfield-hfield. 


Figure  7.  Reconstructed  images  for  a  phantom  with  two  objects,  scanned  at  20  MHz  with  an  array  size  of 
32  x  32  for  a  scan  area  of  8  cm  x  8cm  with  Filter  2  used  for  the  reconstruction,  a)  x-z  plane:  the  plane 
through  the  center  of  the  measured  volume  horizontally,  b)  y-z  plane:  the  plane  through  the  center  of  the 
measured  volume  vertically,  c)  x-y  plane:  at  the  detection  plane,  d)  x-y  plane:  1  cm  behind  the  detection 
plane,  and  e)  x-y  plane:  plane  near  the  object’s  center,  2  cm  behind  the  detection  plane. 


Notice  that  there  are  multiple  bright,  ringing  pattems/loops  in  Figure  7(b);  they  are  artifacts  due 
to  Fourier  transform.  Higher  filters  used  in  the  Fourier  domain  for  the  reconstruction  algorithm  give  more 
ringing  patterns.  This  limits  the  bandwidth  of  the  spatial  frequency  components  in  the  reconstruction  and 
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thus  limits  the  spatial  resolution  of  the  reconstructed  images.  If  higher  spatial  frequency  components  are 
included,  we  have  to  be  careful  to  differentiate  the  real  hidden  objects  from  FFT  reconstruction  artifacts. 
In  the  next  section,  we  will  discuss  how  to  select  filter  numbers,  or  cutoff  spatial  frequencies,  to  optimize 
the  reconstructed  images.  Further  studies  and  understanding  of  the  cause  of  ringing  patterns  are 
underway. 

The  phantom  used  to  obtain  Figure  8  contains  two  objects,  one  black  marble  having  13  mm 
diameter  and  one  white  bead  of  8  mm  diameter.  They  were  implanted  at  the  middle  of  a  4.5  cm  thick 
phantom  in  Z  direction  and  about  2.5  cm  apart  vertically  (or  horizontally)  symmetric  to  the  center  of  the 
phantom.  Similarly,  32x32  pixels  were  chosen  to  scan  a  8cm  x  8cm  area  with  a  modulation  frequency  of 
20  MHz.  The  absorption  and  reduced  scattering  coefficients  of  the  background  are  0.017±0.001  cm'1  and 
15.9±0.6  cm'1,  respectively.  Figures  8(a)  and  8(b)  are  amplitude  plots,  respectively,  at  the  detection  plane 
and  2  cm  behind  the  detection  plane  near  the  centers  of  the  objects.  It  is  seen  that  at  the  detection  plane, 
the  image  for  the  white  object  does  not  appear  very  well,  whereas  the  image  for  the  black  object  is  very 
obvious  and  clear.  The  reconstructed  image  on  the  XY  plane  near  the  centers  of  the  objects  (Figure  8(b)), 
however,  displays  two  clear  imaging  spots  with  different  magnitudes.  The  big,  dark  spot  is  corresponding 
to  the  black  marble,  and  the  small,  light  spot  stems  from  the  white  bead.  We  can  easily  obtain  the 
separation  distance  of  2.7  cm  between  the  spots  from  Figure  8(b).  To  confirm  the  result,  we  turned  the 
phantom  90°  around  the  Z  axis  (the  axis  perpendicular  to  the  detection  plane)  and  rescanned  the  phantom. 
The  reconstructed  images  for  this  case  are  shown  in  Figures  8(c)  and  8(d),  and  the  results  are  very 
consistent  with  those  observed  in  Figures  8(a)  and  8(b). 


Figure  8.  Amplitude  images  of  a  phantom 
with  two  objects,  a  13  mm  black  marbl 
and  a  8  mm  white  bead,  scanned  at  20  MHz 
with  an  array  size  of  32  x  32.  a)  and  c):  at 
the  detection  (x-y)  plane,  b)  and  d):  near 
the  center  of  the  samples,  2  cm  behind  the 
detection  plane.  Filter  of  2  was  used  for 
this  reconstruction. 


Detection  plane  Tumor  plane 
(8cm  x  8  cm)  (2  cm  behind) 


the  white  bead 
on  top  of  the 
black  marble 


the  white  bea 
and  the  black 
marble  are  i 
parallel 
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In  order  to  know  how  different  wavelengths  and  powers  of  the  laser  affect  the  quality  of  the 
reconstructed  images,  we  performed  two  measurements  on  the  same  phantom  using  a  780  nm,  3  mW  and 
a  680  nm,  9  mW  laser  diode.  In  this  case,  the  phantom  contained  two  objects,  one  12  mm  black  marble 
and  12  mm  clear  bead.  These  two  objects  were  imbedded  in  the  middle  of  a  5  cm  thick  phantom,  and 
other  parameters  were  the  same  as  those  used  for  Figure  8.  The  background  medium  has  absorption  and 
reduced  scattering  coefficients  of  0.016+0.00  cm"1  and  10.0+0.7  cm"1  at  780  nm,  and  0.015±0.001  cm  1 
and  13.7±0.3  cm"1  at  680  nm.  With  a  3  mW  laser  at  780  nm,  we  reconstructed  amplitude  images  of  the 
phantom,  shown  in  Figures  9(a)  and  9(b),  at  the  detection  plane  and  at  the  plane  passing  the  centers  of  the 
objects,  2.5  cm  behind  the  detection  plane.  It  is  seen  that  the  clear  bead  does  not  give  a  very 
distinguished,  resolvable  feature  in  either  case.  Furthermore,  after  we  switched  to  a  higher  (9  mW)  power 
laser  at  680  nm,  the  reconstructed  images  (Figures  9(c)  and  9(d))  look  similar  to  those  with  the  780  nm 
laser,  except  that  the  680  nm,  higher  power  laser  gives  a  little  better  spatial  resolution  than  the  other  case. 
This  may  be  because  a  higher  power  laser  gives  a  better  signal-to-noise  ratio.  But  overall,  there  is  no 
significant  difference  between  these  two  lasers  in  imaging  large-mismatched  objects  in  our  phantoms. 

Detection  plane  Tumor  plane 
(8cm  x  8  cm)  (2  cm  behind) 


with  a  780  nm, 
3  mW  laser 


with  a  680  nm, 
20  mW  laser 


Figure  9.  Amplitude  images  of  a  phantom 
with  two  objects,  scanned  at  20  MHz  with  an 
array  size  of  32  x  32.  Filter  is  2.  a)  and  c):  at 
the  detection  (x-y)  plane;  b)  and  d):  at  the 
plane  passing  through  the  center  of  the 
objects. 


To  study  the  relationship  between  the  quality  of  the  reconstructed  images  and  pixel  spacing,  we 
also  performed  a  series  of  scans  on  the  same  phantom  with  array  sizes  of  64  x  64,  32  x  32,  and  16  x  16 
having  pixel  spacing  of  0.125  cm/pixel,  0.25  cm/pixel,  and  0.5  cm/pixel,  respectively.  In  this  case,  the 
phantom  was  4.4  cm  thick  and  contained  a  8  mm  black  and  a  9  mm  white  bead.  They  were  located  at  the 
middle  depth  of  the  sample  and  about  4  cm  apart  along  Y  axis  with  the  white  bead  on  top  of  the  black 
bead.  Figure  10  presents  a  set  of  reconstructed  amplitude  images  at  both  the  detection  plane  and  at  the 
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plane  near  the  centers  of  the  objects,  2  cm  below  the  detection  plane.  The  filter  used  here  was  1,  and  the 
laser  was  at  780  nm.  Figure  10  shows  clearly  that  there  is  not  much  difference  in  image  quality  among 
the  reconstructed  images  using  either  smaller  (0.125  cm/pixel)  or  larger  (0.5  cm/pixel)  pixel  spacing. 


Detection  plane  Tumor  plane 

(8cm  x  8  cm)  (2  cm  behind) 


Figure  10.  Image  comparisons 
for  three  different  pixel  spacing, 
a),  c),  e):  Images  reconstructed 
at  the  detection  plane;  b),  d),  f): 
near  the  tumor  plane,  2  cm 
behind  the  detection  plane. 
Filter  used  here  was  1 . 


64x64  pixels 
(0.125  cm/pixel) 


32x32  pixels 
(0.25  cm/pixel) 


16x16  pixels 
(0.5  cm/pixel) 


To  obtain  the  object  location  in  Z  direction,  we  first  determine  the  x  and  y  coordinates  of  the 
detected  object(s)  based  on  XY  images,  such  as  those  shown  in  Figure  10,  and  then  plot  the  amplitude 
versus  Z  to  obtain  the  peak  position.  The  z  coordinate  at  the  peak  of  the  amplitude  will  be  the  location  of 
the  hidden  object  in  Z  direction.  To  show  this  strategy,  we  plot  a  z  plot  in  Figure  1 1(a)  based  on  Figure 
10(d)  at  Xpixei=20,  Ypixei=6,  which  are  the  pixel  coordinates  of  the  observed,  dark  spot  for  the  hidden  black 
bead.  In  this  case,  the  pixel  step  in  Z  direction  is  the  same  as  that  used  in  X  and  Y  directions,  i.e.,  0.25 
cm/pixel.  Figure  11(a)  displays  a  peak  at  pixel  number=8  behind  the  detection  plane,  implying  that  this 
peak  is  at  8  x  0.25  cm=2  cm  behind  the  detection  plane.  Similarly,  Figure  11(b)  plots  the  amplitude 
versus  Z  pixels  at  Xpixei=20,  Ypjxei=22,  for  the  other  observed  spot  resulting  from  the  hidden  white  bead. 
This  figure  shows  a  dip  at  Zpixe|=8  behind  the  detection  plane,  giving  the  consistent  location  of  the  hidden 


object  in  Z  direction.  A  dip,  rather  than  a  peak,  in  amplitude  might  stem  from  a  possible  fact  that  the 
absorption  of  the  white  bead  is  lower  than  the  background  material. 


Pixel  numbers  in  Z  from  the  illumination  plane 


Figure  1 1 .  Z  plots  along  the  lines  perpendicular  to  the  detection  plane  and 
containing  the  black  bead  (a)  and  white  bead  (b).  The  pixel  spacing  in  Z 
direction  is  0.25  cm/pixel,  corresponding  to  that  shown  in  Figures  10(c)  and 
10(d).  The  filter  used  here  was  2. 

To  confirm  the  object  location  in  depth  obtained  from  Figure  1 1,  we  plot  two  Z  plots  for  the  data 
with  16x16  array  sizes,  as  shown  in  Figures  10(e)  and  10(f),  at  the  corresponding  x  and  y  coordinates  of 
the  two  implanted  objects.  These  two  Z  plots  are  given  in  Figures  12(a)  and  12(b),  respectively,  for  the 
black  and  white  bead.  In  this  case,  the  pixel  spacing  in  Z  is  0.5  cm/pixel.  One  can  see  that  a  peak  and  a 
dip  appear  at  Z  pixel  number=4  behind  the  detection  plane,  i.e.,  4  x  0.5  cm  =  2  cm  behind  the  detection 
plane.  Thus,  the  object  location  in  Z  direction  given  in  figure  1 1  is  in  good  agreement  with  that  in  figure 
11. 
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Figure  12.  Z  plots  along  the  lines  perpendicular  to  the  detection  plane  and  containing 
the  black  bead  (a)  and  white  bead  (b).  The  pixel  size  in  Z  direction  is  0.5  cm/pixel, 
corresponding  to  that  shown  in  Figures  9(e)  and  9(f).  The  filter  used  here  was  2. 

7.  Data  Analysis 


Parameters  used  to  scan  and  reconstruct  images  are  as  follows:  1)  scanning  dimensions/areas, 
such  as  8x8  or  4x4  cm2,  2)  pixel  spacing,  such  as  0.125,  0.25,  or  0.5  cm/pixel,  3)  modulation  frequencies 
at  20,  40,  or  100  MHz,  and  4)  filters  numbers  or  cutoff  spatial  frequencies.  What  is  the  relationship 
among  those  parameters,  and  how  do  they  affect  the  resolution  of  the  reconstructed  images?  The  answer 
to  these  questions  may  be  helpful  to  optimize  the  scanning  and  reconstruction  parameters  for  good-quality 
reconstructed  images  with  best  efficiency. 

When  a  FFT  of  a  2-dimensional  spatial  domain  function  is  performed,  the  number  of  pixels  (or 
elements)  in  the  spatial  frequency  domain  is  equal  to  the  same  number  of  pixels  in  the  spatial  domain. 
The  2-d  spatial  frequencies,  fx  and  fy,  will  be  multiple  increments  of  Afx  and  Afy.  Thus,  we  can  rewrite  the 
transfer  function  (TF)  as  follows: 


ZJ  f  f  f  s,  {fx  >  fy  ) 

Hz(fx,fy)=  -  —  .  -  exp 

MfxJy)  { 


j/' Azji 


(  V/y~“V(2 nfx)2-(2nfy) 
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(7) 


=  exp j-  t±z^^)  +  (2m\fx)2  +{2^fy)2  -/^ 

where  Az  is  the  distance  from  the  detection  plane  to  the  reconstructed  plane  behind  the  detection  plane, 
and  Az  is  a  positive  number,  n  is  an  integer,  ranging  from  0  to  nmax,  where  nmax  is  equal  to  one  half  of  the 
number  of  pixels  in  a  row  in  the  spatial  domain.  In  the  reconstruction  algorithm,  n  corresponds  to  the 
filter  number  for  the  cutoff  spatial  frequency.  It  is  known  that  the  increments  in  the  2-d  spatial  frequency 
domain,  Afx  and  Afy,  are  related  to  the  dimension  of  the  spatial  domain  as: 


1 

dim  ension(x) 


1 

dim  ension(y) . 


(8) 


Therefore,  the  TF  becomes 


6  Q 

ft  ft  -Aztfcos(-)  iAzB  sm(~) 

Hz  (fx  ,fy)  =  exp  (-Azfifcos(-)  +  /  sin(-)]  }  =  e  2  e  2 


(9) 


where  B  is  a  real  number  and  B  =  \  [3 /^/V+(~~~)2  +  (~~)2]2  +  (“)2  [  5 


Lx  Ly 


(10) 


0  =  tan  1 
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Eqs.  (9)  to  (1 1)  show  quantitative  dependence  of  the  TF  on  the  spatial  dimension,  Lx  and  Ly,  modulation 
frequency,  co,  and  cutoff  pixels,  n,  for  spatial  frequencies.  It  is  seen  from  eq.  (9)  that  the  TF,  Hz,  has  an 

-Az#cos(-)  iAzi?sin(~ ) 

decaying  amplitude  of  e  2  with  a  phasor  of  e  2  .  For  a  given  phantom  or  tissue  with  fixed 
values  of  absorption  and  reduced  scattering  coefficients,  paand  ps’  having  typical  values  of  0.1  and  10 
cm-1,  B  and  0,  and  thus  Hz,  are  functions  of  co  and  n,  if  the  scan  dimensions,  1^  and  Ly,  are  selected. 
Figure  13  demonstrates  that  0/2  is  relatively  small  with  a  range  of  modulation  frequencies  less  than  100 
MHz.  This  means  that  the  transfer  function  is  approximately  a  pure  attenuation  function  with  the  phasor 
equal  to  1  when  the  modulation  frequencies  are  within  100  MHz.  Using  the  same  parameters  as  those  for 

-Azflcos(-) 

Figure  13,  we  plot  the  amplitude  of  e  2  with  0=0,  as  shown  in  Figure  14.  It  is  seen  that  at  lower 
modulation  frequencies,  such  as  20  MHz,  the  amplitude  of  the  transfer  function  decays  much  faster  than 
that  at  higher  modulation  frequencies,  such  as  1  GHz.  This  plot  also  shows  that  to  obtain  the  same 
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magnitude  of  the  normalized  amplitude  of  TF,  the  number  of  pixels  for  Af  would  be  much  smaller  at  20 
MHz  than  that  at  1GHz.  This  makes  the  cutoff  filter  in  the  spatial  frequency  domain  to  be  more  limited  at 
lower  modulation  frequencies  than  at  higher  modulation  frequencies,  i.e.,  only  to  be  1  or  2  pixels  at  20 
MHz  but  to  be  up  to  5  more  pixels  at  1GHz.  In  addition,  based  on  eq.  (10),  the  cutoff  filters  can  be 
equivalently  improved  by  increasing  the  scanning  dimensions,  Lx  and  Ly. 


o  4  8  12  16  20 

Number  of  Pixels  for  delta  f 


Figure  13.  Dependence  of  0/2  in 
degrees  on  the  number  of  pixels  for  Af 
in  the  spatial  frequency  domain.  The 
parameters  used  for  this  plot  are: 
pa=0.1  cm’1,  fis’=10  cm’1,  Lx=Ly=8  cm. 


0  2  4  6  8  10  12  14 


Figure  14.  Dependence  of  normalized 
amplitude  of  transfer  function  on  the 
number  of  pixels  for  Af  in  the  spatial 
frequency  domain.  The  parameters 
used  for  this  plot  are:  pa=0.1  cm'1, 
jis’=10  cm-1,  Lx=Ly=8  cm. 


Number  of  Pixels  for  Delta  F  (Filter  #) 


Thus,  to  improve  the  resolution  of  the  reconstructed  images,  we  can  use  higher  modulation 
frequencies  or  increase  the  sizes  (dimensions)  of  the  detection  plane.  Decrease  in  pixel  spacing  would  not 
be  a  crucial  factor  at  this  point,  as  long  as  the  sampling  frequency  (number  of  pixels  per  unit  length)  is 
larger  than  the  spatial  frequency  of  the  imbedded  object,  i.e.,  l/(2*diameter  of  the  object),  according  to 
Nyquisf  s  law. 

To  confirm  the  findings  mentioned  above,  we  have  also  conducted  theoretical  simulations  using 
similar  parameters.  The  results  are  very  consistent  with  the  experimental  data. 
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In  the  case  of  biomedical  imaging,  let  us  discuss  two  extreme  cases  as  follows:  the  first  case  is 
where  the  medium  is  highly  absorbing  in  comparison  to  the  modulation  frequency.  For  example,  at  20 
MHz,  we  have  |ia  (typically  0.1— 0.3  cm-1)  »co/v  (=0.006  cm’1  for  20  MHz  signal),  and  the  TF  becomes  a 
purely  exponential  decay  function.  We  can  approximately  use  the  following  relationship  to  optimize  the 
parameters  for  the  cutoff  filter  and  the  scanning  dimension,  n,  Lx  and  Ly. 

(27in/Lx)2+(27cn/Ly)2=3paps’-3  cm'2,  or  n/Lx~0.2  cm'1. 

In  the  second  case,  where  the  modulation  frequency  is  comparable  to  or  higher  than  the  absorption  factor 
of  the  medium,  we  will  have  |ia  (typically  0.04-0.1  cm'1)  «cd/v  (=0.3  cm’1  for  1  GHz  signal).  The  TF 
has  an  exponential  decay  term  and  phase  shift.  In  this  case,  we  will  use  the  following  relationship  to 
estimate  the  parameters  for  the  cutoff  filter  and  the  scanning  dimension,  n.  Lx  and  Ly. 

(27mAfx)2+(27mAfy)2=3ps’co-37r  cm'2,  or  nAfx~0.6  cm'1. 

8.  Conclusions  and  Future  Work 

What  we  have  learned  from  this  summer  research  project  is  as  follows: 

1)  the  FFT  reconstruction  algorithm  works  in  imaging  and  positioning  inhomogeneities  3- 
dimensionally  with  good  accuracy; 

2)  background  information  is  necessary  and  important  for  accurate  reconstruction; 

3)  At  present  stage,  for  high  sensitivity,  the  imbedded  object  should  be  off-centered  with  respect  to 
the  light  source  and  the  detection  plane.  Further  studies  are  underway  to  improve  the 
reconstruction  algorithm  in  order  to  eliminate  this  requirement. 

4)  large  scanning  dimension/area  is  more  important  than  small  scanning  pixel  size  to  achieve  better 
resolution; 

5)  high  modulation  frequencies  (>500  MHz)  may  be  helpful  to  achieve  better  resolution;  low 
modulation  frequencies  can  provide  adequate  information  for  tumor  screening; 

6)  this  summer  project  has  demonstrated  that  the  experimental  set-up  works  and  the  results 
generated  from  it  are  in  good  agreement  with  the  theoretical  simulations.  Inhomogeneities  can  be 
detected  and  located  in  three-dimensions.  However,  we  are  not  able  to  characterize  the  optical 
properties  of  the  inhomogeneities  at  this  point.  This  remains  for  further  study. 

Our  future  work  includes: 

1)  make  phantoms  containing  objects  which  are  small,  purely  absorptive,  perturbations  from  the 
background  medium; 

2)  take  measurements  on  the  new  phantoms; 
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3)  complete  a  diffraction  tomographic  model  for  quantitative  reconstruction  of  optical  property 
imaging; 

4)  apply  the  diffraction  tomographic  model  to  the  data  taken  from  the  new  phantoms  for  pa,  14* 
image  reconstruction; 

5)  extend  the  modulation  frequencies  to  KHz  range,  and  explore  the  possibility  at  using  CCD 
cameras  for  image  capture; 

6)  extend  the  system  to  clinical  tests. 
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THERMOELECTRIC  ENERGY  CONVERSION 
WITH  SOLID  ELECTROLYTES 


M.A.K.  Lodhi 
Professor 

Department  of  Physics  and  Engineering  Physics 
Texas  Tech  University 


Abstract 


Alkali-Metal  Thermal-to- Electrical  Converter  (AMTEC)  is  a  high  temperature  regenerative  concentration 
cell  which  converts  thermal  energy  directly  into  electrical  energy.  The  efficient  operation  of  AMTEC  cell 
involves  several  challenging  heat  and  mass  transfer  which  have  been  investigated.  The  AMTEC  cell 
consists  of  several  sodium  vapor  tubes.  The  pressure  drop  is  calculated  for  each  node  in  order  to  calculate 
the  total  pressure  drop  through  the  tube.  The  flow  equations  are  derived  in  continuum,  free  molecular 
motion  and  transition  regimes.  As  expected  the  molecular  flow  regime  is  more  challenging.  We  studied 
this  regime  with  Dusty  Gas  Model  (DGM).  This  approach  is  expected  to  remove  the  discrepancy  between 
the  observed  data  and  the  results  predicted  by  this  model.  Some  alternate  to  sodium  as  working  fluids  have 
been  surveyed.  Potassium  seems  to  a  viable  candidate. 
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THERMOELECTRIC  ENERGY  CONVERSION 
WITH  SOLID  ELECTROLYTES 


M.A.K.  Lodhi 


1.  Introduction 

Direct  conversion  of  heat  energy  to  electrical  energy — that  is  conversion  without  the  use  of  moving 
mechanical  parts — has  both  aesthetic  and  practical  appeal.  Several  devices  for  thermoelectric  direct 
conversion  have  been  developed.  The  Seebeck  effect,  thermionic  and  magnetohydrodynamic  generators 
are  familiar  examples.  None  of  these  direct  converters  has  been  widely  adopted  because  of  practical 
problems  such  as  parasitic  heat  loss  or  lack  of  a  critical  material  with  the  physical  properties  necessary  for 
high  efficiency  and  good  durability. 

Among  the  less  well  know  direct  thermoelectric  converters  are  the  thermally  regenerative  electrochemical 
systems  (TRES).  These  devices  are  closed  electrochemical  systems  which  include  electrochemical  cells 
that  produce  electrical  power.  The  reactants  for  these  cells  are  regenerated  within  the  device  by  thermal 
energy  from  a  heat  source  which  flows  through  the  device  to  a  heat  sink.  TRES  have  also  been  called 
electrochemical  heat  engines.  By  analogy  with  the  well-known  heat  engines  in  which  a  working  fluid  is 
carried  around  a  thermodynamic  cycle.  They  are  Carnot-limited  in  efficiency.  Many  types  of  TRES  were 
investigated  in  the  1950s  and  1960s  in  the  search  for  improved  methods  of  converting  the  thermal  output  of 
nuclear  reactors  to  electrical  power. 

All  of  the  early  TRES  were  plagues  by  practical  problems  such  as  inefficient  heat  exchange,  electrode 
polarization,  slow  chemical  regeneration  kinetics.  Materials  separation  problems  and  corrosion.  Power 
densities  of  these  early  systems  were  usually  limited  to  a  few  tens  of  milliwatts  per  square  centimeter  of 
electrode  area  and  thermoelectric  efficiencies  were  below  5%.  Some  comprehensive  review  articles  on 
TRES  are  now  available  in  literature.  The  development  of  beta-alumina  solid  electrolytes  has  made 
possible  a  new  TRES.  The  alkali  metal  thermoelectric  converter  which  is  capable  of  high  power  and  high 
efficiency  and  avoids  the  practical  difficulties  of  the  earlier  systems. 

With  the  advent  of  Faraday’s  law  of  electromagnetic  induction  and  the  steam  engine,  dynamic  conversion 
systems  evolved  rapidly  and  were  perfected  with  engineering  details  mostly  for  terrestrial  and  aircraft 
devices.  As  a  potential  alternative  to  dynamic  conversion  systems,  static  thermal  to  electrical  conversion 
systems  have  been  investigated,  particularly  for  space  power  systems.  With  the  discovery  of  the  Seebeck 
effect,  the  thermoelectromotive  force,  which  occurs  in  materials  under  the  effect  of  a  temperature  gradient, 
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has  been  the  driving  force  for  static  thermoelectric  generators.  A  large  number  of  static  devices  have  been 
evolved  and  studied.  The  goal  for  static  conversion  systems  has  been  to  identify  static  converters  with 
efficiencies  that  are  competitive  with  dynamic  systems.  To  this  end  currently,  NASA  Lewis  Research 
Center  (LeRC)  has  an  interest  in  developing  thermal  energy  storage  (TES)  for  the  solar  dynamic  ground 
test  demonstration  (SDGTD)  program.  Phillips  Laboratory  has  an  interest  in  developing  an  electrical 
power  system  for  the  payload  in  Low  Earth  Orbit  (LEO)  for  a  duration  of  10  years.  For  conversion 
technology,  Jet  Propulsion  Laboratory  (JPL)  is  currently  interested  in  alkali  metal  thermal  to  electrical 
conversion  (AMTEC)  with  respect  to  the  Fast  Flyby  Pluto  Express  Mission.  In  all  these  endeavors  the 
effort  is  aimed  at  developing  advanced  power  conversion  and  thermal  management  and  control  subsystems 
for  space  applications.  As  part  of  this  effort,  Alkali-Metal  Thermal-to-Electric  Converter  (AMTEC)  device 
is  being  developed  and  tested  at  USAF  Phillips  Lab  with  the  goal  of  providing  feedback  or  improved 
efficiency,  durability,  and  long  life  time.  To  reach  this  goal,  thorough  and  accurate  mathematical 
modeling  and  analyses  as  well  as  experimental  testing  of  AMTEC  device  is  required.  Our  efforts  therefore, 
on  theoretical  side  have  been  to  develop  an  accurate  mathematical  model  of  AMTEC  at  Phillips  Lab  this 
summer. 

2.  Background  of  AMTEC 

AMTEC  is  a  relatively  new  type  device,  based  on  the  principle  of  sodium  concentration  cell,  conceived  in 
late  sixties  [1,2].  AMTEC  devices  are  a  leading  power  conversion  candidate  for  a  large  number  of 
applications,  particularly  low  mass,  low  cost,  long  life,  and  high-efficiency  power  systems  for  satellites  and 
exploratory  space  probes.  AMTEC  systems  are  very  attractive  for  these  applications,  because  they  are 
static  conversion  devices  which  can  provide  efficiencies  close  to  the  theoretical  Carnot  efficiency. 
Efficiencies  as  high  as  19%  have  been  measured  in  laboratory  experiments.  Optimized  AMTEC  devices  of 
a  mature  design  can  provide  theoretical  conversion  efficiencies  between  20  and  40%  when  operating  at  a 
moderate  hot-side  temperature  between  900  K  and  1300  K  and  a  cold-side  temperature  between  400  K  - 
800  K.  Other  advantages  of  AMTEC  devices  include  low  maintenance,  high  durability,  high  specific 
power,  modular  construction,  and  the  ability  to  operate  in  conjunction  with  a  high-temperature  combustion, 
a  nuclear  or  a  solar  heat  source.  Therefore,  AMTEC  devices  are  attractive  not  only  for  space,  but  also  for 
many  terrestrial  and  aerospace  applications. 

2  A  Operation  Principles  of  AMTEC  Cell 

To  understand  the  working  of  AMTEC,  its  operating  cycle  is  illustrated  in  Fig  1  [3]  A  closed  vessel  is 
divided  into  a  high  temperature  (T2)  and  high  pressure  (P2)  region  in  contact  with  a  heat  source  and  a  low 
temperature  (T,),  low  pressure  (P|)  region  in  contact  with  a  heat  sink.  AMTEC  is  a  high-temperature 
regenerative  concentration  cell  for  elemental  sodium,  which  directly  converts  heat  to  electrical  energy. 
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The  critical  material  in  the  operation  of  the  AMTEC  is  the  beta”-  alumina  solid  electrolyte  (BASE),  a 
sodium-ion  conductor  whose  ionic  conductivity  is  much  larger  than  its  electronic  conductivity.  In  an 
AMTEC  cell,  the  BASE  separates  the  hot  (high-pressure)  region  filled  with  a  small  quantity  of  liquid 
sodium  (typically  <10%)  in  contact  with  the  heat  source,  from  the  cold  (low-pressure)  region  occupied 
with  sodium  vapor.  A  porous  electrode  covers  the  low-pressure  (outer)  side  of  the  BASE.  Electrical  leads 
in  contact  with  the  porous  electrode  and  the  high-temperature  liquid  sodium  exit  through  the  wall  of  the 
device  and  are  connected  to  an  external  load.  The  pressure  differential  across  the  BASE  forces  ionization 
of  sodium  atoms  on  the  hot-side.  The  ions  diffuse  through  the  BASE  toward  the  low-pressure  side  in 
response  to  the  pressure  differential  gradient  of  free  Gibbs  energy  while  the  electrons  circulate  through  the 
external  load,  producing  electrical  work  [4].  Electrons  and  sodium  ions  recombine  at  the  interface  between 
the  BASE  and  the  porous  electrode.  The  resulting  sodium  atoms  absorb  their  heat  of  vaporization,  move 
through  the  electrode  and  the  vapor  space,  then  release  their  heat  of  vaporization  at  the  low-temperature 
condenser  surface.  A  wick  structure  or  an  electromagnetic  pump  return  the  liquid  condensate  to  the  high 
temperature  side  of  the  BASE  to  complete  its  circulation.  More  details  on  AMTEC  are  in  scarce  literature 
[1-6].  AMTEC  possesses  conversions  efficiencies  much  higher  than  other  direct  thermoelectric  devices.  It 
was  reported  at  19%  in  1981  [7],  with  a  power  density  of  1  W/cm2  [6].  Its  hot  region  temperature  is  in  the 
range  of  900-1300  K  [8].  This  makes  it  compatible  with  the  general  purpose  heat  source  (GPHS)  [9]  or  the 
projected  SP-100  nuclear  space  power  reactor  [10].  AMTEC  is  limited  to  1350K  because  of  sodium 
interactions  with  the  BASE.  The  lower  temperature  region  is  limited  to  500  K  by  the  need  to  maintain  the 
sodium  in  the  liquid  state.  As  the  condenser  temperature  increased  above  700  K  the  efficiency  decreases 
[1 1].  Besides,  AMTEC  has  high  efficiency  near  Carnot  [12],  low  maintenance  and  high  durability,  modular 
construction  and  the  ability  to  use  high  temperature  combustion,  nuclear,  or  solar  heat  source  [13]. 

3.  Heat  and  Mass  Flow 

The  operation  of  an  AMTEC  cell  involves  several  challenging  heat  and  mass  transfer  processes  requiring 
thorough  investigations,  namely:  (a)  the  diffusion  of  sodium  ions  through  the  BASE  tube;  (b)  the 
evaporation  of  sodium  at  the  BASE-porous  electrode  interface  (  c  )  the  sodium  vapor  flow  through  the 
porous  electrode  and  the  vapor  space  in  the  free-molecular  regime;  and  (d)  the  recirculation  of  liquid 
sodium  through  the  capillary  tube.  Any  of  these  processes  can  limit  the  operation  of  the  AMTEC  cell. 
Because  sodium  is  frozen  at  room  temperature,  the  startup  of  the  AMTEC  cell  from  a  frozen  state  and  the 
effect  of  the  initial  distribution  of  the  sodium  working  fluid  in  the  cell,  is  of  particular  concern. 

The  performance  and  the  conversion  efficiency  of  an  AMTEC  cell  depend  on  the  internal  and  external 
parasitic  heat  and  electrical  losses.  The  internal  heat  losses  are  made  up  of  1)  conduction  through  the 
output  current  leads  and  2)  supporting  structure  of  the  BASE  tube,  and  of  internal  radiation  heat  losses 
from  the  3)  porous  electrode  surface  to  the  condenser  surface  (through  the  vapor  space).  The  latter 
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depends  on  the  a)  geometry,  b)  axial  temperature  distribution  in  the  condenser  shell  and  the  c)  emissivity  of 
both  surfaces.  4)  Of  particular  importance  is  the  fraction  of  condenser  surface  covered  with  the  the 
sodium  condensate.  If  the  porous  electrode  and  condenser  surfaces  exchange  radiative  energy  at  rates 
comparable  to  those  of  ideal  bodies,  the  maximum  efficiency  of  the  cell  would  be  less  than  10%. 
Fortunately,  the  reflectivity  of  the  sodium  liquid  film  is  greater  than  98%  in  the  infrared.  Thus,  if  the 
surface  of  the  film  is  smooth,  the  radiative  losses  could  be  as  low  as  2%  of  black  body  emission,  even  if 
the  porous  electrode  is  an  ideal  emitter.  To  take  advantage  of  the  high  reflectivity  of  the  sodium 
condensate,  the  shape  of  the  condenser  surface  must  be  designed  to  reflect  most  of  the  radiation  emanating 
from  the  porous  electrode  back  to  it. 

The  external  heat  losses  are  made  up  of  conduction  through  the  5)  support  structure  of  the  condenser  (or 
the  outer  shell  of  the  cell),  and  6)  radiation  from  the  cell  outer  shell  to  the  shield  surrounding  the  cell.  7) 
Heat  leakage  through  the  radiation  shield  may  also  be  important.  (In  the  case  of  a  system  comprised  of 
several  AMTEC  cells,  radiation  exchange  between  the  shells  of  the  different  cell  must  be  considered.)  Due 
to  the  high  operating  temperature  of  the  AMTEC  radiation  is  a  major  heat  transfer  mechanism.  8)  A  major 
challenge  involved  in  calculating  radiation  heat  transfer  is  determining  the  geometric  view  factor.  9) 
Except  for  the  sodium  vapor  there  is  no  convection  heat  loss.  The  sodium  vapor  heat  transfer  through 
convection  only  affects  the  internal  degree  of  freedom  of  AMTEC. 

4.  Multi-tube  Model  Code 

Two  different  types  of  cells  are  being  considered  for  testing,  single-cell  units  and  multi-cell  units  for  the 
Pluto  Express  Mission.  For  the  purpose  of  this  research  period  we  were  primarily  concerned  with  the 
multi-tube  unit. 

We  formulated  a  mathematical  model  of  AMTEC  which  takes  into  consideration  all  those  aspects, 
discussed  in  section  3  ,  thus  leading  to  a  prototype  designing,  fabricating,  testing  and  further  improving  of 
the  cell.  The  three  flow  regimes,  just  mentioned  above,  are  arbitrarily  determined  by  Knudsen  number,  Kn, 
as  follows  [14]: 


(a) 

Continuum  (  viscous ) 

Kn  <  0.01 

(b) 

Slip  ( transition ) 

0.01  <  Kn  <  1 

(c) 

Free  Molecular  (  molecular  ) 

Kn  >  1. 

This  necessarily  requires  to  divide  the  AMTEC  cell,  under  investigation  into  appropriate  parts  as  accurately 
as  possible  where  those  regimes  dominate  and  derive  equations  necessary  to  calculate  the  vapor  flow 
pressure  drop  including  those  regimes. 
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This  model  has  four  options  for  the  effect  of  the  molecular  collisions  with  the  wall  particularly  in  the 
rarefied  flow  regime.  In  order  to  incorporate  the  effect  of  molecule-  wall  collisions  more  realistically,  the 
so  called  Dusty  Gas  Model  (  DGM  )  is  used  as  the  fourth  approach  [15-18].  The  DGM  treats  this  effect  as  a 
diffusion  problem  involving  the  original  gas  molecules,  in  motion,  and  the  giant  wall  molecules  at  rest  and 
uniformly  distributed.  The  problem  is  thus  formally  treated  as  the  one  component  flow  of  a 
multicomponent  gas  mixture. 

A  computer  code  using  TK  solver  has  been  developed  for  heat  and  mass  flow  which  calculates  the  internal 
electrical  and  thermal  losses,  and  predicts  the  multi-tube  AMTEC  performance.  The  pressure  drop  is 
calculated  using  DGM.  Chevron  type  of  shielding  is  an  option  available.  The  vapor  is  assumed  to  vary 
linearly.  An  input  and  output  set  of  parameters  on  the  variable  sheets  are  given  in  the  Appendix. 

5.  Alternate  Working  Fluid 

To  look  for  viable  alternate  to  sodium  as  working  fluid  we  have  collected  all  those  elements  which  have 
negative  electrode  potential  with  reference  to  hydrogen  electrode.  They  are  given  in  the  accompanying 
table.  Though  lithium  has  the  highest  negative  potential  of  -3.05  V  and  is  the  lightest  metal,  it  has  high 
melting  and  boiling  points. 

As  a  working  fluid,  Potassium  (K)  is  a  viable  alternative  to  sodium.  K  ion  conducting  B”  alumina  (K- 
BASE)  has  been  made[19]  though  its  conductivity  is  not  known  at  AMTEC  operating  temperatures.  As 
with  optimized  electrodes,  analyzing  the  module  performance  with  K  working  fluid  (a  K-module)  requires 
several  assumptions.  First,  the  RBASEof  K-BASE  probably  differs  from  Na-BASE,  and  K-BASE  is 
probably  not  as  good  of  a  conductor.  The  K-BASE  conductivity  is  assumed  one  half  the  Na-BASE 
conductivity.  Similarly  the  exchange  current,  J0  is  probably  smaller  if;  use  a  value  that  is  one  half  the  J0  for 
Na.  Finally,  the  calculations  must  be  corrected  for  such  fluid  specific  values  as  the  vapor  pressure,  the 
molecular  weight,  the  heat  capacity,  and  the  latent  heat  of  vaporization. 

A  K-module  is  expected  to  outperform  a  similar  Na-module  at  lower  hot  zone  temperatures  because  K  has 
a  higher  vapor  pressure  (PK  than  Na  (PNa).  However,  at  higher  temperatures,  the  change  in  PNA  with 
temperature  is  greater  than  the  change  in  Pk ,  and  the  Na-module  performance  overtakes  the  K-module 
performance.  For  the  tube  bundle  operating  between  1000  K  and  500  K,  the  change  from  Na  to  K  working 
fluid  increases  the  peak  power  from  200  to  250  W/l  improves  the  efficiency  from  29  to  34%.  For  hot  zone 
temperatures  greater  than  about  1 100  K,  the  Na-module  outperforms  the  K-module.  For  the  flat  plate 
device,  the  K-module  performance  is  greater  than  the  Na-module  even  at  1200  K  because  of  the  thinner 
ionic  conductor.  The  conversion  efficiency  of  the  flat  plate  K-module  is  4  to  9%  better  than  the  Na- 
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module  over  the  temperature  range  with  a  efficiency  of  35%  at  1100  K.  The  peak  power  approximately 
doubles  in  the  low  temperature  range  (800-1000  K),  and  at  1 100  K  the  K-module  still  has  a  27%  greater 
power  than  the  Na-module. 

6.  Conclusion 


A  multi-tube  AMTEC  system  has  been  designed  and  a  code  in  TK  solver  has  been  prepared  with  DGM. 
The  input-output  parameters  are  given  on  the  variable  sheets,  An  alternate  working  fluid 

to  sodium  has  been  investigated.  It  seems  that  developing  a  K-electrolyte  and  using  potassium  as  the 
working  fluid  could  improve  both  efficiency  and  power  density  for  AMTEC. 
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APPENDIX 


These  are  the  parameters  that  you  would  need  to 
modeling  a  different  working  fluid  in  the  AMPS 


change 
tk  model 


if 


you  were 


(1)  On  the  input  sheet: 


Qs  -  sensible  heat  of  working  fluid 
Ql  -  latent  heat  of  working  fluid 
rNa  -  electrical  resistivity  of  Na 
^Nav  -  thermal  conductivity  of  Na  vapor 
RhoNa  -  Sodium  density 
eNa  - • emissivi ty  of  sodium 
M  -  sodium  molecular  weight 


(2)  On  the  rule  sheet: 

under  " ;  Saturation  Pressure"  the  correlation 

for  the  saturation  pressure  needs  to  be  changed  for  the  calculation 
of  Pc,  Pa,  and  Pb  to  the  correct  working  fluid  correlation 

(3)  On  the  function  sheet: 

the  function  "ut"  needs  to  be  changed  to  a  viscosity  correlation 
for  the  new  working  fluid  (the  current  one  is  for  sodium) 
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Study  of  Nonlinear  Dynamics  in  a  Diode  Pumped  Nd:YAG  laser 


Tim  Newell 

University  of  New  Mexico 
Abstract 


Nonlinear  dynamics  are  investigated  in  a  specially  designed  and 
constructed  pump  modulated  Nd:YAG  laser.  The  laser  operates  in  single 
longitudinal  and  transverse  mode  as  well  as  in  one  polarization  eigenmode. 
Investigations  reveal  that  a  chaotic  state  can  be  obtained.  The  route  into 
chaos  is  not  characterized  by  a  sequence  of  period  doubling  cascades  but  rather 
as  an  apparent  crisis.  Coexisting  attractors  are  also  observed. 
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Study  of  Nonlinear  Dynamics  in  a  Diode  Pumped  Nd:YAG  laser 


Tim  Newell 


Introduction 

Traditional  studies  of  chaotic  dynamics  in  experimental  systems  have 
usually  focused  on  electronic  circuits  {1...4}  or  mechanical  nonlinear 
oscillators  {5}.  The  advantage  of  such  systems  is  typically  a  low  dimensional 
phase  space  orbit  (the  attractor),  the  minimal  amount  of  system  noise,  and  the 
ready  access  to  the  state  variables.  In  contrast,  fundamental  investigations  of 
nonlinear  dynamics  in  optical  systems  have  not  been  pursued  intensely  due 
to  their  usually  high  dimensional  attractors  as  well  as  relatively  large 
amounts  of  noise  {6}.  However,  optical  systems  have  certain  advantages. 
Semiconductor  lasers,  in  particular,  intrinsically  possess  very  large 
bandwidths  that  is  particularly  appealing  in  regards  to  communication 
purposes.  Unfortunately,  their  high  oscillation  frequencies  means  that  real 
time  acquisition  of  intensity  time  series  can  only  be  observed  for  very  short 
intervals.  Hence  detailed  analysis  of  the  signal  cannot  be  made  using  time 
domain  meaurements. 

The  purpose  of  this  investigation  is  study  nonlinear  dynamics  in  an 
optical  system  as  a  prelude  to  active  control  of  chaotic  behavior.  In  this 
project,  nonlinear  dynamics  is  studied  in  a  neodymium  doped  yttrium- 
aluminum-garnet  laser  (Nd:YAG).  Due  principally  to  the  long  fluorescence 

lifetime  of  the  excited  atomic  levels  (tf  =  240jis),  the  fundamental  relaxation 
oscillation  frequency  of  this  device  is  typically  less  than  100  kHz. 
Consequently,  long  intensity  time  traces  of  this  laser  may  be  captured  then 
subsequently  examined  using  techniques  in  chaotic  data  characterization  {7}. 
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Furthermore,  this  laser  when  operating  in  a  single  longitudinal  and  single 
transverse  mode  as  well  as  in  a  single  polarization  eigenstate  can  be  modeled 
using  two  ordinary  differential  equations  {8}.  This  allows  for  concurrent 
theoretical  and  experimental  studies  and  quantitative  comparison  between 
the  two. 

The  Nd:YAG  laser 

Historically  Nd:YAG  lasers  have  been  pumped  using  rare  earth 
containing  arc  lamps  or  tungsten-filament  lamps  {9}.  However,  due  to  the 
large  thermal  load  and  small  efficiency  level,  these  sources  place  great 
demands  on  the  engineering  of  the  laser  design.  In  contrast,  laser  diode 
pumping  of  the  cavity  is  preferable  due  to  excellent  spectral  matching  with 
the  neodymium  absorption  bands,  good  mode  matching  with  the  Nd:YAG 
laser  field,  and  reduced  thermal  load  on  the  crystal.  Additionally, 
modulation  of  the  laser  diode's  intensity  is  a  simple  matter  of  modulating  its 
driving  current. 

The  experimental  arrangement  is  shown  in  Fig.  1.  The  laser  diode 
pump  (SDL-5410-G),  LD,  produces  a  single  longitudinal  and  single  transverse 

mode  beam  at  X=810nm.  In  spite  of  the  highly  elliptical  beam,  good  mode 
matching  between  the  pump  and  YAG  cavity  can  be  achieved  if  the 
arrangement  is  carefully  aligned.  Additionally,  an  anamorphic  prism  may  be 
incorporated  into  the  design  increasing  the  mode  matching.  Unfortunately, 
even  minute  reflections  from  different  optical  components  were  sufficient  to 
stimulate  nonlinear  behavior  in  the  diode  lasers  output.  The  consequences 
of  such  feedback  is  mode  hopping  and  high  frequency  oscillations.  A 
discussion  concerning  the  suppression  of  this  effect  is  below.  A  Faraday 
isolator  (Optics  for  Research  NIR-5),  FI,  measured  at  39.6  dB  isolation  follows 
the  collimating  lens  (Newport  FL-10B),  CL.  After  the  isolator,  an  acousto¬ 
optic  modulator,  AOM,  was  sometimes  used  to  modulate  the  pump  by 


24-4 


deflecting  the  light  sinusoidally.  In  the  course  of  the  experiments,  it  was 
found  that  the  current  modulated  diode  pump  was  more  noisy  than  the 
current  stabilized  laser  diode.  Also,  the  acousto-optic  modulator  was  not 
limited  to  a  useful  frequency  range  of  50  kHz  as  was  the  laser  diode's  current 
source.  An  iris,  I,  and  a  f=75  cm  focusing  lens,  FL,  was  placed  between  the 
AOM  and  the  YAG  crystal,  Y.  The  crystal  measures  5mm  x  2mm.  The 
absorption  efficiency  at  810nm  is  over  80%  {10}.  The  laser  cavity  is  in  a 
hemispherical  configuration.  The  input  side  is  formed  by  the  YAG  crystal.  It 
is  HR  at  the  Nd:YAG  lasing  wavelength  of  1064  nm  and  HT  at  the  pump 
wavelength.  A  Brewster  window,  B,  was  sometimes  placed  inside  the  cavity. 
Its  purpose  is  to  insure  oscillation  of  only  polarization  mode.  However,  its 
usage  required  a  larger  cavity  size  meaning  that  more  than  one  longitudinal 
mode  could  lase  for  large  pump  intensities.  However,  without  this  window, 
the  cavity  length  could  be  reduced  to  less  than  7mm  (hence  the  FSR  of  the 
cavity  is  almost  12  GHz).  In  this  case,  it  was  found  that  with  proper  mirror 
tilting,  not  only  would  the  laser  be  single  longitudinal  mode,  but  it  was  also 
single  polarization  as  well  for  low  pumping  powers.  A  curved  output 
coupling  mirror,  OC,  with  R=98%  completed  the  lasing  cavity.  The  YAG  light 
was  collimated  and  stray  pump  light  was  blocked  using  a  high  bandpass  filter. 
Another  Faraday  isolator,  FI,  was  employed  to  prevent  reflections  back  into 
the  cavity  from  the  Fabry-Perot  interferometer,  FP.  A  beamsplitter  directed  a 
fraction  of  the  YAG  light  into  a  New  Focus  1811  125  MHz  photodetector.  The 
electrical  signal  was  then  conditioned  using  a  Stanford  Research  Systems 
SR-560  low  noise  preamplifier  then  input  into  a  Bittware  BAT-301  digitizer 
(see  the  Appendix)  and  a  Tektronix  DSA-601  oscilloscope. 

The  laser  is  operated  close  to  threshold  in  order  to  prevent  multiple 
longitudinal  modes  from  lasing  as  well  as  two  polarizations.  Depending  on 
the  alignment  and  mode  matching  of  the  laser  diode  pump,  high  order 
transverse  modes  can  oscillate  as  well  as  other  curious  shaped  far  field 
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intensities.  However,  under  optimal  alignment,  the  first  Gaussian  mode 
dominates.  The  relationship  between  the  pump  and  relaxation  oscillations 
follows  the  expected  curve  of  a  4-level  laser  {11},  namely 


TT. 


*--i 


(1) 


where  cor  is  the  relaxation  oscillation,  P  is  the  pump,  is  the  pump  at 


threshold,  xf  is  the  fluorescence  lifetime,  240jxs,  and  x  is  the  photon  lifetime. 
Figure  2  shows  this  relation  between  pumping  and  the  square  of  the 
relaxation  oscillations.  From  this  figure,  x  is  found  to  be  3.28  ns. 

Stabilization  of  the  laser  diode.  At  pump  currents  up  to  2.5  times  the 
threshold  (Ith=30mA),  this  particular  laser  is  prone  to  mode  hopping.  At 
larger  currents,  mode  hopping  still  occurs  but  the  laser  operates  in  a  relatively 
stable  manner.  The  linewidth  of  the  laser  diode  can  be  reduced  and  mode 
hopping  prevented  by  effective  use  of  feedback  into  the  semiconductor  laser 
cavity.  This  is  accomplished  in  the  following  manner,  see  Fig.  1. 
Immediately  after  the  collimating  lens,  a  diffraction  grating,  DG,  is  placed  so 
that  the  pump  beam  intersects  the  surface  at  an  angle  of  approximately  45°. 
The  first  order  diffracted  beam  is  then  directed  towards  the  Nd:YAG  crystal. 
The  second  order  diffracted  beam  is  reflected  back  into  the  laser  cavity  by  a 
nearby  mirror,  M.  This  alignment  creates  a  compound  cavity  in  which  the 
external  cavity  modes  couple  with  the  internal  cavity  modes  producing  a 
more  stable  laser  diode. 

Heat  dissipation  and  thermal  lensing.  The  crystal  rests  lengthwise  in  a 
small  groove  drilled  in  the  aluminum  mount  and  is  covered  with  a  grooved 
cap.  It  is  clamped  into  place  from  the  top  using  a  small  plastic  tipped  set 
screw.  Thermal  contact  is  enhanced  around  the  entire  circumference  using 
thermally  conductive  jelly.  Though  the  temperature  is  controlled  from 
below  using  a  thermo-electic  (TE)  cooler,  the  high  thermal  conductivity  of  the 
aluminum  surrounding  the  crystal  insures  that  for  temperatures  close  to 
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room  temperature,  there  will  be  a  large  degree  of  symmetry  in  the  thermal 
cooling  around  its  outer  periphery.  This  produces  a  radial  symmetric  index  of 
refraction  profile  which  is  advantageous  for  compensation  of  thermal 
lensing.  At  cooler  temperatures  deviations  from  the  radial  symmetry  is 
expected. 

Mechanical  stability.  The  NdrYAG  crystal  is  clamped  into  an 
aluminum  block  that  is  cemented  to  a  thermo-electric  cooler  which  is 
cemented  to  a  large  aluminum  block.  This  piece  is  bolted  onto  a  rail.  The 
output  coupling  mirror  is  mounted  to  a  New  Focus  single  mode  fiber 
coupler.  This  coupler  is  designed  for  the  high  degree  of  stability  necessary  to 
launch  light  into  a  single  mode  fiber,  and  should  not  be  prone  to  mechanical 
vibrations  or  drift.  The  fiber  coupler  is  bolted  to  a  large  aluminum  block  that 
is  bolted  to  the  rail. 

For  completeness.  Table  1  summarizes  properties  of  the  NdrYAG 
crystal. 

Table  1 

Physical  Properties  of  NdrYAG  laser  crystal 


Chemical  Formula 
Density  of  Nd  ions 
Emission  cross  section 
Fluorescence  lifetime 
Saturation  intensity 
Index  of  refraction,  n 
Linear  thermal  dispersion 
Heat  conductivity 
Density 

Specific  heat,  cp 
Linear  Heat  transfer 


Nd:Y3Al5012 

1.38  x  1020 

cm'3 

4.1  x  10 19 

cm2 

240  x  10'6 

s 

2.0 

kW  cm'2 

1.82 

9.86  x  10'6 

13 

Wm'1  K'1 

4.55 

gem'3 

607 

J  kg’1  K'1 

27.5 

Wm'2  K'1 
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Dynamical  equations  of  motion 

Two  nonlinearly  coupled  second  order  differential  equations  have 
been  used  to  successfully  describe  the  laser  {8}.  These  are 

E(t )  =  -^[G(0  -  <x{t)]E(t)  +  icoE(t)  +  4et]  (3) 

and 

G(0  =  ±[p-G(0(l  +  |£(0|2)].  (4) 

In  Eq.  3,  E  represents  the  electric  field,  t  is  the  photon  lifetime,  a  is  the 
internal  losses,  to  is  the  frequency  detuning  from  the  nearest  cavity  mode,  e  is 
the  spontaneous  emission  noise  strength,  and  q  is  a  Gaussian  random 

number.  In  Eq.  4,  G  is  the  gain,  tf  is  the  fluorescene  lifetime,  and  P  is  the 
pumping.  In  the  literature,  lasers  that  can  be  described  by  this  type  of 
equations  are  designated  "Class-B".  Without  the  noise  term,  these  equations 
will  predict  only  the  steady  state  operation  the  laser.  Chaotic  behavior  does 
not  occur  in  such  systems  due  to  the  fact  there  is  simply  not  enough  room  in 
two  dimensional  system  for  the  exponential  divergence  of  adjacent 
trajectories  along  with  the  folding  of  orbits  on  themselves.  In  order  to 
enlarge  the  phase  space,  the  pump  parameter  or  the  internal  losses  are  be 
modulated  and  the  pump  term  becomes: 

P  =  P0  +  P,sin(o)j )  (5) 

The  result  of  Eq.  5  is  to  create  a  2  1/2  dimensional  system.  In  this  driven 
system  nonlinear  dynamics  can  take  place.  In  this  investigation,  we  will 
pump  with  a  frequency  nearby  the  relaxation  oscillations:  0.2 cor  <cod<  2<ar. 

Experimental  Results 

The  laser  is  pumped  very  close  to  threshold  in  order  to  obtain  the 
single  mode  behavior  described  above.  The  lasing  intensity  without  intended 
pump  modulation  is  characterized  by  very  low  amplitude  relaxation 
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oscillations  that  are  typically  between  20  kHz  and  100  kHz  depending  on  the 
pump  level.  This  undesirable  trait  can  be  traced  to  fluctuations  in  the  laser 
diode  pump  beam.  Complete  stabilization  of  the  intensity,  unfortunately, 
was  never  obtained.  Consequently,  the  results  are  tainted  and  the  nonlinear 
dynamics  are  obscured  by  the  omnipresent  noise.  When  the  laser  pump  is 
slightly  modulated  at  frequencies  in  the  vicinity  of  the  relaxation  oscillations, 
the  CW  output  of  the  Nd:YAG  is  replaced  by  large  amplitude  spiky  pulses. 
This  is,  in  a  sense,  a  form  of  Q-switching  by  gain  pumping.  Here  the 
nonlinear  interplay  between  the  electric  field  and  the  carrier  inversion  level 
cause  the  amplitude  of  the  pulses  and  the  time  spacing  between  them  to  be 
chaotic.  If  the  modulation  frequency  becomes  too  great,  at  least  1.5  times  the 
relaxation  oscillations,  then  the  Nd:YAG  laser  is  unable  to  respond  and  this 
pulse  train  will  damp  out.  Hence  the  focus  is  on  driving  the  laser  between  0.2 
and  twice  the  relaxation  oscillation  frequency.  Time  series  of  the  intensity  are 

shown  in  Fig  3.  In  these  figures,  the  modulation  frequency,  ood,  is  kept 

constant  just  below  <Dr,  while  the  amplitude,  P1;  is  different.  In  Fig.  3(a),  the 
laser  fluctuates  chaotically.  The  amplitude  of  the  pulses  varies  extensively 
and  in  addition,  the  intervals  between  pulses  is  irregular.  Though  the  time 
series  is  aperiodic  with  no  evident  correlation  between  successive  pulses,  this 
is  not  the  case.  In  Fig.  4,  a  first  return  map  is  constructed.  It  plots  the  N+lsf 
peak  versus  the  Nth  peak.  There  is  an  arch  like  structure  to  the  plot 
indicative  of  a  deterministic  process.  It  is  smeared  due  to  the  insidious 
presence  of  noise.  (The  smearing  may  also  be  due  in  part  to  the  fact  that  the 
system  may  be  high  dimensional.  In  this  case,  the  two  dimensional  plot  will 
have  compressed  the  points  so  that  neighboring  points  may  be  false  nearest 
neighbors  (7}).  In  contrast  to  the  time  series  shown  in  Fig.  3,  for  time  series  of 
a  stochastic  processes  there  would  be  no  structure  and  the  return  map  would 
appear  as  a  blob  of  points.  Figure  3(b)  is  measured  with  a  higher  level  of 
pumping.  We  find  an  interesting  phenomenon,  coexisting  attractors.  Here 
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the  laser  is  seen  to  be  first  lasing  with  large  amplitude  pulses,  but  then  falls  to 
a  train  of  low  amplitude  pulses.  In  terms  of  the  dynamical  equations,  such  a 
scenario  occurs  when  multiple  solution  sets  exists  for  the  given  parameter 
space.  The  noiseless  system  follows  one  set  or  the  other  depending  on  the 
initial  conditions.  In  the  experimental  arrangement,  the  inherent  noise 
causes  a  switching  from  one  state  to  the  other.  This  will  have  deleterious 
consequences  if  the  desired  result  is  a  strongly  gain  switched  laser. 

Conclusions 

In  this  investigation,  a  specially  constructed  diode  pumped  Nd:YAG 
laser  was  constructed  and  pumped  modulated  in  order  to  unleash  its  rich 
nonlinear  dynamical  behavior.  High  quality  time  series  were  acquired  using 
a  state  of  the  art  3  MHz  16-bit  vertical  resolution  digitizer  and  digital  signal 
processor.  The  laser  exhibits  chaotic  behavior  for  a  wide  range  of  parameters 
of  the  modulation  depth  and  frequency  of  modulation.  While  a  particular 
route  into  chaos  was  not  verified,  a  period  doubling  bifurcation  from  PI  to  P2 
was  recorded.  The  large  amount  of  noise  in  the  system  may  have  prevented 
high  bifurcations  from  being  observed  directly.  In  spite  of  this  bifurcation,  it 
appears  that  the  laser  enters  a  chaotic  state  via  a  crisis.  An  important  result  is 
the  observation  of  coexisting  attractors,  i.e.  two  existing  solutions  of  the 
dynamical  equations  exist.  The  system  follows  one  solution  or  the  other 
depending  on  the  initial  conditions.  Noise  destabilizes  the  stability  of  the 
solution  and  the  phase  space  trajectory  may  visit  either  solution  set.  In  this 
regime,  the  laser  jumps  between  large  and  small  amplitude  spiky  oscillations. 
The  importance  of  the  result  is  that  very  small  stabilizing  signals  may  be 
implemented  in  order  to  steer  the  trajectory  from  one  attractor  to  the  other, 
in  doing  so,  a  binary  communication  sequence  can  be  created. 

Future  investigations  would  likely  involve  employing  an  Argon  laser 
to  pump  the  Nd:YAG  crystal.  This  laser  may  be  preferable  to  the  diode  laser 
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due  to  the  low  amount  of  noise,  low  responsivity  to  feedback,  and  circular 
fundamental  Gaussian  beam  profile.  The  focus  remains  on  mapping  the 
route  into  chaos  then  to  stabilize  existing  unstable  periodic  orbits  in  order  to 
use  as  a  communications  channel. 

In  conclusion,  though  solid  state  lasers  represent  the  oldest  type  of 
these  devices,  their  nonlinear  dynamics  has  yet  to  be  fully  researched  and 
characterized.  Thus,  exciting  new  possibilities  for  their  usage  come  into 
being.  Chaotic  systems  are  characterized  by  (a)  a  wide  range  of  behavior  and 
(b)  an  extreme  dependence  to  initial  conditions  and  to  perturbations  of  a 
system  parameter.  Hence  a  judicious  choice  of  a  steering  signal  {12}  could  be 
implemented  in  order  to  choose  what  type  of  behavior  the  laser  will  follow. 
This  new  form  of  control  can  lead  to  many  diverse  applications  such  as 
secure  communications. 

Appendix  Data  Acquisition 

A  the  time  of  this  writing,  digitizing  oscilloscopes  are  inherently 
limited  by  either  their  sampling  rate  or  the  vertical  resolution  of  the  acquired 
signal.  Generally,  the  faster  the  sampling  rate,  the  lower  the  resolution. 
While  the  sampling  rates  may  be  as  high  as  2  gigasamples  per  second,  the 
signal  is  vertical  resolved  in  usually  28  (8-bit  resolution)  or  210  (10-bit 
resolution)  intervals.  This  experimental  restraint  often  comes  as  a 
disconcerting  surprise  to  theorists  who  commonly  work  with  double  wide 
machine  precision  floating  values.  To  bridge  this  precision  gap,  we  have 
employed  the  combination  of  a  3  MHz  16-bit  resolution  digitizing  card 
(Bittware  BAT-301)  installed  in  a  personal  computer  slot  with  a  digital  signal 
processor  (Bittware  Omega  DSP)  as  a  data  receiver.  The  sampling  rate  of  the 
digitizer  is  sufficient  for  successfully  capturing  time  series  from  the  Nd:YAG 
laser.  FIFO  memory  in  the  digitizer  allows  for  the  high  speed  capture  of  1024 
data  samples  .  The  arrangement  is  shown  in  Fig.  5. 
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The  digitizer  DSP  combination  must  be  programmed  so  that  data  is 
properly  transferred  from  one  board  to  the  other  while  the  digitizer 
continuously  operates.  To  achieve  this,  after  a  predefined  number  of  samples 
are  acquired,  the  digitizer  issues  a  hardware  interrupt.  This  interrupt, 
designated  IRQ3  by  the  DSP,  signals  the  DSP  that  it  should  read  the  samples 
into  its  data  memory.  Consequently  the  digitizer  is  programmed  for  the 
sampling  rate  and  the  number  of  samples  to  acquire  before  an  interrupt  signal 
is  issued.  In  turn,  the  DSP  must  be  programmed  to  process  the  incoming 
request,  write  the  samples  to  memory,  and  keep  a  tally  of  the  total  number  of 
samples.  When  the  total  number  of  samples  are  obtained,  they  are  written  to 
a  file  on  the  computer's  hard  disk. 

As  the  DSP  card  is  constructed  to  be  Harvard  architecture,  the  address 
location  of  the  program  code  and  the  data  variables  must  be  specified  in  the 
physical  memory  banks.  This  is  accomplished  by  use  of  an  architecture  file 
whose  purpose  is  to  map  out  the  program  memory  and  the  data  memory.  If 
this  architecture  file  does  not  allocate  sufficient  resources  for  either  the 
program  code  or  the  data  variables,  then  the  C  source  code  will  not  compile. 
The  process  of  specifying  the  architecture  file  is  not  too  difficult,  but  problems 
did  arise  (and  were  not  successfully  resolved)  when  data  arrays  larger  than 
64K  words  were  desired. 

A  second  issue  concerns  synchronization  between  the  two  boards 
While  the  digitizer  may  go  along  acquiring  data  at  a  3  MHz  rate,  the  DSP 
must  process  this  data  on  a  30  MHz  clock  cycle  .  This  means  that  the  DSP 
must  acknowledge  the  interrupt,  handle  the  interrupt  overhead  time, 
retrieve  a  data  sample,  store  it  in  memory,  then  reset  the  interrupt  before  the 
subsequent  interrupt  is  issued  by  the  digitizer.  If  this  process  cannot  be 
completed  in  time,  the  digitizers  FIFOs  wrap  around  and  data  is  overwritten 
destroying  the  continuity  of  the  time  series.  In  practice  the  process  works  well 
up  to  1  MHz  sampling  speeds,  but  can  break  down  above  these  rates.  This  is 
due  to  the  fact  that  naive  implementation  of  the  generic  interrupt  procedures 
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that  are  part  of  the  DSP  software  library  will  result  in  unacceptably  large 
overhead  times  since  they  can  demand  between  35  and  120  CPU  clock  cycles. 
However,  in  our  implementation,  we  rewrite  the  interrupt  handler  software 
so  that  an  interrupt  is  handled  in  only  a  few  clock  cycles.  This  savings  in  time 
allows  data  to  be  acquired  at  the  full  3  MHz  rate. 

The  Omega  DSP  also  possesses  a  pair  of  digital-to-analog  converters 
(DAC).  This  allows  us  the  opportunity  to  acquire  data,  analyze  it,  then  issue  a 
controlling  signal.  This  feature  can  be  used  to  control  a  chaotic  trajectory  or 
apply  steering  and  communications  signals  to  a  chaotic  carrier  wave. 
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PREPARATORY  WORK  TOWARDS  A  COMPUTER  SIMULATION  OF  ELECTRON 

BEAM  OPERATIONS  ON  TSS  1 


Michael  J.  Pangia 
Associate  Professor 
Department  of  Geology  &  Physics 
Georgia  Southwestern  State  University 


Abstract 

This  work  is  the  first  stage  towards  a  Particle-In-Cell  simulation  that  will  be  used  to 
explain  the  low  kHZ  frequency  electron  modulations  observed  by  the  Space  Particle 
Correlator  Experiment  (SPACE)  during  beam  operations  on  the  Tethered  Satellite  System 
(TSS  1).  The  approach  is  to  work  in  the  frame  of  reference  of  the  orbiter  with  the  ambient 
plasma  flowing  past  it.  A  2-D  (two  position  and  two  velocity  components)  treatment  is 
chosen  based  on  the  geometry  of  the  problem  and  computer  limitations.  The  main  emphasis 
of  this  work  is  to  test  that  the  free-flowing  plasma  is  stable  before  including  the  effects  of  a 

beam. 
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FURTHER  ANALYSIS  OF  KILOHERTZ  ORDER  WAVES  ASSOCIATED  WITH 
ELECTRON  BEAM  OPERATIONS  ON  STS  46 

Michael  J.  Pangia 


L  Introduction 

Strong  electron  modulations  in  the  0.5  to  2.0  kHz  frequency  range  observed  by  the  Space 
Particle  Correlator  Experiment  (SPACE)  during  beam  operations  on  the  Tethered  Satellite  System 
(TSS  1)  have  be  identified  by  Gough  et  al.  [1997].  The  purpose  of  this  research  is  to  begin  work 
on  a  Particle-In-Cell  (PIC)  simulation  of  beam  operations  to  determine  the  mechanisms 
responsible  for  generating  the  observed  plasma  waves.  Progress  was  made  in  testing  the  stability 
of  algorithms  when  just  the  free-flowing  plasma  is  included,  i.e.,  without  any  interaction  with 
the  beam  taking  place. 

II.  Basics  of  Model 

Figure  1  shows  the  basic  simulation  geometry  that  was  chosen.  Its  features  are: 

(1)  The  frame  reference  is  that  of  the  orbiter. 

(2)  It  is  two  dimensional  (two  position  and  two  velocity 
components). 

The  magnetic  field  is  perpendicular  to  Figure  1 ,  so  that  the  beam 
region  corresponds  is  cylindrical  with  the  electrons  gyrating 
around  the  field  lines.  It  is  believed  that  the  major  effect  of  the  electron  beam  is  to  heat  the 
plasma.  However,  the  problem  would  have  to  be  treated  in  3-D  to  correctly  account  for  the 
heating  process.  To  simplify  the  problem  so  that  a  two  dimensional  treatment  may  be  used,  the 
heating  process  is  to  going  be  modeled  using  some,  as  of  yet  undecided,  algorithm. 


Simulation  Box 
Fig.  1 
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Although  the  electrons  act  like  a  fully  magnetized  fluid  and  treating  the  electrons  as  a 
fluid  would  be  computationally  efficient,  the  equations  of  state  and  transport  coefficients  for  both 
the  ambient  and  beam  electrons  are  unknown.  Therefore,  it  was  decided  to  treat  both  the 
electrons  and  the  ions  as  particles  using  explicit  time  stepping. 

ITT.  Stability  of  Free-Streaming  plasma 

The  first  step  is  to  insure  that  the  free-streaming  plasma  is  stable  before  adding  a  beam. 
A  question  arose  as  to  how  to  handle  particles  that  leave  the  simulation  box.  The  geometry  of 
Figure  1  permits  the  use  of  period  boundary  conditions  at  the  top  and  bottom  boundaries  (the  Y- 
limits  of  the  simulation  box).  However,  since  the  plasma  would  be  slowing  down  in  the  presence 
of  a  beam,  periodic  conditions  do  not  apply  at  the  left  and  right  boundaries  (the  X-limits  of  the 
simulation  box). 

The  boundary  conditions  chosen  for  the  electric  field  is  to  set  the  electric  potential  to  zero 
at  the  left  and  right  boundaries.  As  for  the  particles,  simply  deleting  particles  that  exit  either  the 
left  or  right  boundaries  of  the  simulation  box  would  result  in  an  instability,  because  particles  that 
would  eventually  gyrate  back  into  the  simulation  box  once  having  exited  the  left  and  right 
boundaries  would  be  getting  deleted.  The  remedy  was  to  do  two  things.  (1)  Initially  add  particles 
in  the  regions  to  the  left  and  right  sides  of  the  simulation  region  having  the  same  initial  thermal 
and  drift  speeds  as  the  simulation  region.  These  regions  act  as  buffers  that  allow  particles  to  exit 
the  simulation  box  and  return  at  some  later  time  step.  (2)  Use  the  criterion  that  particles  would 
be  removed  from  the  simulation  only  if  they  were  several  grid  steps  outside  of  the  left  or  right 
boundaries.  Particles  that  are  in  the  buffers  are  allowed  to  continue  gyrating  and  drifting,  but  not 
to  experience  any  electric  field. 
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A  test  case  was  run  using  10,000  particles  of  both  species  uniformly  filling  the  simulation 
box.  Other  parameters  are  listed  in  the  following  table  with  fe  and  f«  being  the  electron  plasma 
and  cyclotron  frequencies,  m.  and  v.  being  the  mass  and  thermal  speed  for  species  a  (a=i,e),  and 
V,  being  the  drift  speed  of  the  plasma.  The  mass  ratio  is  set  artificially  high  as  a  simplification. 


Parameter 

Value 

fJU 

1 

mjrck 

10000 

Vj/ve 

0.003 

VA 

0.01 

The  initial  phase  space  plot  Vy  vs.  X  proved  to  be  useful  in  predicting  stability.  Figure 
2  shows  the  initial  electron  configuration.  The  X  values  are  in  electron  Debye  lengths,  K,  and 
Vy  values  are  in  terms  of  v,.  The  simulation  box  extends  from  X=0  to  X=20  K-  Notice  that 
electrons  are  permitted  to  extend  outside  of  the  box.  (The  length  of  the  buffers  were  5Xe.)  The 
skewness  in  the  phase  plot  is  due  to  placing  the  electrons  at  their  actual  position  once  their 
guiding  center  and  velocity  components  were  randomly  chosen.  By  looking  at  Figure  2  one  can 
say  that  the  size  of  the  buffers  must  be  at  least  as  large  as  was  chosen,  otherwise  the  simulation 
box  would  have  a  shortage  of  electrons  of  large  positive  V,  at  X=0  (of  large  negative  V,  at 
X=20Ae)  which  would  lead  to  an  artificial  instability. 
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Phase  Space  at  t=0. 


Figures  3  and  4  show  contours  of  electric  potential  (in  dimensionless  form  e<£/ kTe) 
at  t=0  and  <oe  t=200,  where  oe  is  the  electron  angular  frequency.  Although  the  background  level 
did  grow  over  the  time  period,  it  was  basically  stable.  It  is  believed  that  the  number  of  particles 
can  be  increased  and  the  grid  spacing  can  be  decreased  to  such  an  extent  so  as  to  reduce  the 
noise  to  an  acceptable  level  when  the  simulation  is  run  on  a  faster  computer.  This  will  occur 


after  having  fully  tested  the  simulation 


Phi  Contours  att=0. 


It  is  believed  that  the  basic  model  of  free-streaming  plasma  is  stable.  This  represents 
completion  of  the  first  stage  towards  modelling  the  beam  operations  on  TSS  1.  The  next  stage 
would  be  inclusion  of  a  heating  region  to  model  the  effect  of  the  beam. 
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Phi  Contours  at  t=200 


Fig.  4 
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MODELING  OF  IONOSPHERIC  CONVECTION 
FROM  THE  IMF  AND  SOLAR  WIND  DATA 


Vladimir  O.  Papitashvili 
Associate  Research  Scientist 
Space  Physics  Research  Laboratory 
The  University  of  Michigan 

Abstract 

It  has  been  demonstrated  in  many  studies  that  the  large-scale,  quasi-steady  ionospheric 
convection  at  high  latitudes  is  primarily  controlled  by  the  interplanetary  magnetic  field  (IMF)  and 
solar  wind  (SW)  parameters.  In  this  study  we  applied  a  linear  regression  analysis  technique  to  the 
Defense  Meteorological  Satellite  Program  (DMSP)  electrostatic  potential  database  available  at 
Phillips  Laboratory/Geophysics  (PL/GPS).  We  binned  satellite  data  by  every  1°  of  the  corrected 
geomagnetic  (invariant)  latitude  and  0.5  hour  of  magnetic  local  time  (MLT).  Then  the  regression 
analyses  has  been  applied  to  the  data  in  each  bin  in  a  similar  way  as  it  was  done  for  construction 
of  the  IZMIR  AN  Electrodynamic  Model  (IZMEM)  from  ground  magnetometer  data.  This  al¬ 
lowed  us  to  calibrate  the  IZMEM  model  against  satellite  data  for  a  future  attempt  of  constructing 
the  DMSP  electric  potential  model  for  prediction  of  the  ionospheric  convection  over  both  the 
Northern  and  Southern  polar  regions  solely  from  the  IMF/SW  parameters  measured  near  the 
Earth’s  orbit. 
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MODELING  OF  IONOSPHERIC  CONVECTION 
FROM  THE  IMF  AND  SOLAR  WIND  DATA 

Vladimir  O.  Papitashvili 

Introduction 

The  understanding  of  complex  interactions  that  couple  the  solar  wind  (SW)  plasma,  the  in¬ 
terplanetary  magnetic  field  (IMF),  and  the  Earth’s  magnetosphere  and  ionosphere  is  of  funda¬ 
mental  interest  of  solar-terrestrial  physics.  In  order  to  integrate  what  has  been  learned  through  a 
number  of  national  and  international  disciplinary-oriented  research  programs  for  the  enhance¬ 
ment  of  prediction  technology,  the  U.S.  National  Science  Foundation  (in  cooperation  with 
NOAA,  NASA,  and  DoD)  announced  a  challenge  to  the  space  physics  community  through  the 
establishment  of  the  National  Space  Weather  Program  [Maynard,  1995].  The  program  will  be 
responsible  for  the  development  of  a  science-based  prediction  technology  which  could  help  soci¬ 
ety  to  reduce  economic  losses  due  to  sudden  disturbances  on  the  Sun,  related  geomagnetic 
storms,  and  the  resulting  “blackouts”  in  electric  power,  communication,  navigation,  technologi¬ 
cal  systems,  etc. 

It  has  been  found  in  numerous  studies  that  changes  in  the  strength  and  direction  of  IMF 
near  the  Earth’s  orbit  are  important  factors  in  causing  the  geomagnetic  activity  that  can  greatly 
perturb  the  magnetosphere  and  ionosphere.  The  coupled  solar  wind-magnetosphere-ionosphere 
system  is  most  highly  energized  when  the  IMF  turns  south  and  becomes  anti-parallel  to  the  di¬ 
rection  of  the  Earth's  magnetic  field.  A  variety  of  mechanisms  are  involved  in  the  energy  transfer 
down  to  the  ionosphere  and  thermosphere.  The  geomagnetic  disturbances,  particle  precipitation 
and  aurora  in  the  polar  caps  are  generally  controlled  by  the  directly  driven  processes  because  the 
interplanetary  electric  field  E^f  =  [-VSw  x  BIMf]  is  continuously  imparted  to  the  magnetosphere. 
The  disturbances  in  the  auroral  zone,  middle  and  low  latitudes  (i.e.,  on  a  global  scale)  are  mainly 
controlled  by  the  loading-unloading  processes  when  energy  is  initially  stored  in  the  magneto- 
spheric  tail  and  then  released  in  the  form  of  magnetic  storms  and  substorms.  The  neutral  atmos¬ 
phere  circulation  is  affected  significantly  by  the  global  magnetic  storms;  at  the  same  time,  the 
neutral  wind  “fly-wheel”  effect  causes  a  significant  post-storm  ionospheric  ion  convection. 

There  has  been  a  steady  evolution  in  the  production  of  maps  of  the  ionospheric  electric  po- 
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tential  or  plasma  convection  patterns  that 
are  associated  with  various  orientations 
of  the  IMF.  These  maps  have  consisted 
of  both  empirical  and  theoretical  models 
[e.g.,  Heppner,  1972;  Heelis  et  al.,  1982; 
Foster,  1983;  Heppner  and  Maynard, 
1987;  Rich  and  Maynard,  1989;  Hair- 


Time  that  was  submitted  for  forecasting: 

July  03  1996  16:37  UT 

Position  of  WIND  (in  Re:  X=  211.6  Y=  19.5,  Z=  -10.7 
Time  delay  is  00  hour(s)  and  55  minute (s) 


ston  and  Heelis,  1990;  Papitashvili  et 
al.,  1994;  Rich  and  Hairston,  1994; 
Weimer,  1995;  Ruohoniemi  and  Green- 
wald,  1996].  More  often  than  not,  the 
maps  consist  of  pictorial  sketches;  while 
useful  for  the  knowledge  that  they  pro¬ 
vide,  the  lack  of  a  flexible  or  portable, 
numerical  representation  may  limit  the 


NOWCAST  96-07-03  16:37  UT  FORECAST  96-07-03  17:15  UT 
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maps’  utility. 

For  the  operation  purposes,  it  is  de¬ 
sirable  to  be  able  to  measure  continu¬ 


Figure  1.  An  example  of  the  ionospheric  convection 
specification  and  forecast  obtained  from  the  IZMEM 


model  (http://www.sprl.umich.edu/space  weather). 


ously  the  IMF  parameters  as  well  as  the  solar  wind  velocity  and  density  near  the  Earth's  orbit  at 
the  first  Lagrangian  point  (LI,  approximately  200  Earth’s  radii  upstream).  Having  such  kind  of 
observations  (for  example,  from  the  NASA’s  WIND  and/or  ACE  spacecraft),  we  can  imagine  a 
predictive  scheme  of  the  space  weather  system  based  on  coupled  theoretical  models  that  describe 
the  resulting  flow  of  mass,  energy,  and  momentum  through  the  solar-terrestrial  system.  Figure  1 
shows  results  from  the  on-going  development  of  such  a  forecasting  prototype. 


Brief  description  of  the  IZMEM  model 

Specifically,  from  the  ground  magnetometer  data  we  developed  a  semi-empirical  linear  re¬ 
gression  model  of  the  solar  wind-magnetosphere-ionosphere  interaction  parameterized  by  the 
IMF  strength  and  direction:  the  IZMIRAN  Electrodynamic  Model,  IZMEM  [Feldstein  and  Levi- 
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tin,  1986;  Papitashvili  et  al.,  1994,  1995],  We  postulate  that  the  solar  wind-magnetosphere- 
ionosphere  coupling  can  be  considered,  in  a  first  approximation,  as  a  linear  “black  box”.  This 
"box"  accepts  changes  in  the  IMF  and  solar  wind  plasma  parameters  ( Bx ,  By,  Bz,  velocity  V,  and 
density  n)  as  an  input  signal,  and  then  induces  ground-based  geomagnetic  perturbations  as  the 
output  signal.  This  approach  has  already  been  used  in  other  studies,  in  particular,  those  employ¬ 
ing  the  linear  prediction  analysis  [e.g.,  Clauer,  1986,  and  references  therein]. 

In  this  linear  model  the  regression  coefficients  relate  any  ground  geomagnetic  field  compo¬ 
nent,  for  example,  H,  to  changes  in  the  corresponding  IMF  parameter  [Levitin  et  al.,  1982;  Papi¬ 
tashvili  et  al.,  1990,  1994]: 

H'(6,<j))  =  K'hbx  +  K'hbx  +  K'hbz  +  H'o  (1) 

The  equation  (1)  shows  only  the  relationships  with  the  IMF,  but  its  free  term  can  be  ex¬ 
panded  for  the  solar  wind  parameters: 

H'o  =  K'hbx  +  K'hvi  +  K'nnvi  +  H'oo  (2) 

Here  KHxx  are  regression  coefficients  for  i  =  1,...,24,  where  i  is  the  universal  time  (UT)  hour  and 
H'o  is  a  residual  part  of  (1)  for  average  solar  wind  parameters  (e.g.,  n  =  4  cm'3,  V  =  450  km/s). 
H'oo  represents  the  geomagnetic  variations  that  are  free  from  the  IMF  and  solar  wind  impact. 

Developing  this  model,  we  utilized  the  hourly  mean,  total  values  of  the  IMF  and  ground 
geomagnetic  data  for  each  season  of  the  year  (summer,  winter,  and  equinox).  Data  from  both  the 
Northern  and  Southern  polar  regions  above  &Cgm  =  ±57°  latitude  have  been  subjected  to  the 
multi-regression  analyses.  Then,  utilizing  the  numerical  inversion  technique  [Kamide  et  al., 
1981;  see  also  Papitashvili  et  al.,  1994],  we  can  obtain  the  distribution  of  ionospheric  electro¬ 
static  potentials  over  the  entire  polar  regions  as  a  superposition  of  the  IMF  related  terms  ac¬ 
counting  for  different  interplanetary  conditions  (e.g.,  positive  and  negatrive  Bz  and  By,  various  V 
and  n).  A  number  of  ionospheric  electrodynamic  parameters  (such  a  current  function,  electric 
potentials,  Joule  heating,  magnetic  disturbances,  electric  fields,  ionospheric,  Hall,  Pedersen  or 
field-aligned  currents)  can  rapidly  be  computed  from  the  IZMEM  model  and  mapped  over  both 
the  Northern  and  Southern  polar  regions.  Even  the  ground-induced  currents  (GIC  -  important  for 
the  electric  power  companies)  can  successfully  be  modeled  for  given  IMF  conditions  [e.g., 
Walker  and  Papitashvili,  1994]. 
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Figure  2.  The  IZMEM  output  in  geographic 
coordinates:  A  comparison  of  the  modeled 
ion  convection  velocities  at  300-km  altitude 
with  the  SuperDARN  observations. 


Figure  2  shows  an  example  where  the  iono¬ 
spheric  electric  fields  modeled  by  the  IZMEM  are 
converted  into  the  ion  drift  velocities  at  300  km  alti¬ 
tude  (top  panel)  and  mapped  in  geographic  coordi¬ 
nates  for  the  points  collocated  with  the  SuperDARN 
radars  observations  (lower  panel)  [Kustov  et  al„ 
1997].  One  can  see  that  both  sets  of  velocity  vectors 
exhibit  the  same  trends,  namely  the  westward  flow  at 
lower  latitudes  and  the  eastward  flow  at  higher  lati¬ 
tudes.  Certainly,  the  model  shows  smoother  flows, 
but  in  general  the  prediction  and  measurements  agree 
well.  The  IZMEM  model  can  be  accessed  via  the 
Web  ('http://www.sprl.umich.edu/MIST). 

Thus,  we  conclude  that  the  IZMEM  model  is 
able  to  provide  structural  elements  of  the  high- 
latitude  ionospheric  convection,  that  is,  different 


terms  in  Eqs.  (1)  and  (2)  represent  corresponding  contributions  (elements)  from  different  sources 
to  the  total  convection  pattern.  For  the  first  time,  the  similar  approach  was  utilized  for  the  DMSP 
electrostatic  potential  data. 


Methodology 

The  research  undertaken  at  the  Phillips  Laboratory  was  two-fold.  First,  we  comprehensively 
studied  the  DMSP  ion  driftmeter  database  (total  6.8  Gigabytes)  sorting  four  years  of  data  (1993- 
1996)  from  5  satellites  (F08  -  1.0  GB,  F10  -  1.1  GB,  Fll  -  1.6  GB,  F12  -  1.8  GB,  and  F13  -  1.3 
GB)  according  to  the  IMF,  corrected  magnetic  (invariant)  latitude  and  magnetic  local  time 
(MLT).  According  to  Hairston  and  Heelis  [1993],  the  DMSP  ion  drift  data  have  already  been 
averaged  every  4  seconds  during  the  database  construction.  The  every  record  in  the  data  sets  con¬ 
sists  of  the  time  mark,  horizontal  and  vertical  ion  drift  velocities,  geographic  and  geomagnetic 
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coordinates,  and  electrostatic  potential  calculated  along  the  satellite  track  over  the  high  latitudes 
(the  latter  is  corrected  for  the  corotation  potential). 

The  DMSP  ion  drift  database  has  been  sorted  and  reduced  to  total  460  Mbytes  of  data 
measured  above  40°  invariant  latitudes  in  both  the  Northern  and  Southern  Hemispheres  and 
matched  with  the  available  IMF  records  (hourly  values,  OMNI  database  obtained  from  the 
NASA’s  National  Space  Science  Data  Center).  The  IMF  data  for  a  current  hour  were  used  for  the 
matching  if  the  DMSP  record  had  the  time  mark  between  30:00  and  59:59  minutes  of  each  UT 
hour;  the  previous  IMF  UT  hour  was  matched  with  the  DMSP  records  taken  between  00:00  and 
29:59  minutes  of  each  hour.  This  allows  us  to  suppose  that  the  DMSP  measurements  were  gener¬ 
ally  affected  by  the  averaged  IMF  conditions  pre-existed  during  previous  ~30  or  more  minutes. 

Then  we  sorted  out  all  of  these  records  according  to  the  season  of  the  year  (northern  sum¬ 
mer  and  southern  winter:  May-August;  northern  winter  and  southern  summer:  November- 
February;  equinox:  March,  April,  September,  and  October)  and  certain  IMF  conditions:  (1)  Bz  < 

0  for  all  By,  (2)  Bz  >  0  for  all  By,  (3)  By  <  0  for  Bz  <  0,  (4)  By  >  0  for  Bz  <  0,  (5)  By  <  0  for  Bz 
>  0,  and  (6)  By  >  0  for  Bz  >  0.  For  example,  total  1.7  million  records  were  processed  for  all  sea¬ 
sons  of  the  year  only  to  satisfy  the  1st  condition. 

For  each  of  the  above-mentioned  conditions,  the  DMSP  electrostatic  potential  data  can  be 
binned  in  every  1°  (from  49.5°  to  90°,  centering  on  50°,  51°,  etc.)  of  invariant  latitude  and  0.5  hr 
of  magnetic  local  time  (00:00-29:59,  00:30-59:59,  etc.).  The  last  latitude  bin  is  extended  from 
88.5°  to  90°.  If  a  few  4-sec  consecutive  DMSP  data  records  fell  in  the  same  bin,  the  electrostatic 
potential  was  averaged  for  these  records.  This  improved  the  reliability  of  statistics,  though  the 
number  of  points  in  each  bin  reduced.  After  reduction,  the  number  of  points  in  each  bin  varies 
from  a  few  ones  to  a  few  hundreds  (maximum  -1500  in  certain  bins).  As  expected,  the  maximum 
coverage  was  achieved  for  the  dawn-dusk  local  times;  the  lesser  coverage  (even  zero  coverage)  is 
achieved  for  noon-midnight  regions  at  lower  latitudes. 

Because  such  a  data  processing  consumes  a  lot  of  the  CPU  time,  we  were  able  to  process 
fully  only  the  data  satisfied  the  1st  condition.  According  to  Eq.  (1),  we  applied  the  linear  regres¬ 
sion  technique  to  the  data  in  each  bin  (if  the  number  of  points  in  the  bin  was  less  than  10,  this  bin 
was  skipped).  Due  to  significant  scattering  in  the  DMSP  potential  data  fallen  in  the  same  bin  for 
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a  certain  IMF  condition,  we  applied  two  selection  criteria  to  the  results  of  the  regression  analyses. 
All  analyses  where  the  correlation  coefficients  between  the  DMSP  potentials  and  the  IMF  Bz  ^  0 
data  were  less  than  0.3  were  excluded  from  consideration.  Also,  calculating  background  (quasi- 
viscous)  potential  from  the  free  term  in  Eq.  (1),  we  threw  out  the  bins  where  the  background  po¬ 
tential  exceeded  ±35  kV.  In  most  cases,  the  number  of  "good"  bins  were  about  50%  out  of  the 
total  number  of  bins:  40  x  48  =  1920.  To  ensure  that  the  residual  terms  and  regression  coeffi¬ 
cients  for  Bz(-l-nT)  in  Eq.  (1)  obtained  from  DMSP  data  really  represent  the  background  poten¬ 
tials  and  responses  on  changing  IMF,  we  compared  the  IZMEM  and  DMSP  potential  values  in 
each  bin  where  the  DMSP  data  analyses  were  available.  This  allows  us  to  calibrate  the  IZMEM 
model  against  the  DMSP  potentials  obtained  for  the  1st  IMF  condition. 

Results 

Figure  3  shows  the  correlation  between 
the  DMSP  and  IZMEM  background  potentials 
and  the  regression  coefficients  for  Bz(-l-nT 
step)  for  the  northern  summer  (only  points  sat¬ 
isfied  the  selection  criteria  are  shown).  With  the 
equation 

y  =  ax  +  b  (3) 

we  can  compare  and  evaluate  these  two  models. 

Here  y  stands  for  the  DMSP  potentials  obtained 
as  result  of  the  regression  analyses  with  the 
IMF  data,  x  stands  for  the  original  IZMEM  po¬ 
tentials.  b  is  an  offset  between  two  model  data 
sets,  and  a  is  a  calibration  factor  which  can  be 
applied  to  the  IZMEM  potentials  [see  below].  It 
is  obvious  that  even  applying  such  strict  selec¬ 
tion  criteria,  the  certain  number  of  outliers  still 
survive  in  the  data  set. 


Northern  Summer:  Background  potentials 


Bz  (—1  nT)  regression  coefficients 


Figure  3.  Correlation  between  the  DMSP  and 
IZMEM  potentials  for  northern  summer  (all 
data). 
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The  DMSP  potential  outliers  generally  group  near  zero  IZMEM  potentials.  Because  the 
latter  mainly  represent  the  boundary  condition  on  ±57°  invariant  latitude  where  the  magneto- 
spheric  potential  is  zero,  we  suppose  that  the  DMSP  potentials  in  the  original  data  sets  have  not 
fully  been  corrected  for  the  corotation  potential. 


IZMEM  Potential,  kV 


Bz  (—1  nT)  regression  coeffcients 


IZMEM  Potential,  kV 


Bz  (-1  nT)  regression  coeffcients 


IZMEM  Potential,  kV 


Figure  4.  Correlation  between  the  DMSP  and  IZMEM  potentials  for  the  northern  (left  panels)  and 
southern  (right  panels)  summers  (here  the  DMSP  data  are  cleaned  from  outliers). 

Figure  4  (left  panels)  show  the  same  data  as  on  Figure  3  but  now  the  DMSP  potentials  are 
cleaned  from  outliers  by  applying  the  ±1.5  a  criterion  (a  is  a  standard  deviation  obtained  for  all 
data  in  each  panel  of  Figure  3).  One  can  see  that  this  improved  the  correlation  between  two  data 
sets.  Figures  4-5  show  plots  for  cleaned  (from  outliers)  data  for  all  seasons  of  the  year  and  for 
both  the  Northern  and  Southern  polar  regions. 

Table  1  summarizes  correlation  parameters  between  the  DMSP  and  IZMEM  potentials  and 
shows  comparison  with  the  calibration  factors  obtained  earlier  for  IZMEM  by  Papitashvili  et  al. 
[1994],  Note  that  both  the  correlation  coefficients  for  all  data  ( k0 )  and  for  cleaned  ones  (kc)  are 
presented  in  Table  1. 
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OMSP  Potentiol,  kV  DMSP  Potential,  kV 


Northern  Equinox:  Background  potentials 


-2  0  2 
IZMEM  Potentiol,  kV 


Southern  Equinox:  Background  potentials 


IZMEM  Potential,  kV 


Bz  (-1  nT)  regression  coeffcients 


Northern  Winter:  Background  potentials 


Bz  (—1  nT)  regression  coeffcients 


IZMEM  Potentiol,  kV 


Southern  Winter:  Background  potentials 


IZMEM  Potential,  kV 


IZMEM  Potentiol,  kV 


Figure  5.  Correlation  between  the  DMSP  and  IZMEM  potentials  for  the  northern  (left  panels)  and 
southern  (right  panels)  equinoxes  (two  top  panels)  and  winters  (two  lower  panels). 
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The  number  of  outliers  ignored  for  the  second  round  of  correlation  analyses  by  applying  the 
±1.5  g  criterion  was  only  about  10-15%  of  the  total  number  of  points  in  the  analysis.  Figures  4-5 
show  clearly  that  the  DMSP  potentials  generally  have  the  same  trends  as  the  corresponding 
IZMEM  potentials.  Better  correlation  between  two  data  sets  is  achieved  for  the  summer  and 
equinox  seasons  in  both  hemispheres.  The  winter  potentials  correlate  better  in  the  Northern 
Hemisphere;  the  southern  potentials  correlate  fairly. 

The  coefficients  b  in  Table  1  can  be  compared  with  the  amplification  factors  /obtained  by 
Papitashvili  et  al.  [1994]  solely  from  the  assumption  that  the  background  (viscous)  cross-polar 
potential  drop  should  not  exceed  -35  kV  [Reijf  et  al.,  1981].  The  latter  value  was  revised  in  a 
number  of  studies  [e.  g.,  Sergeev  and  Kuznetsov,  1981;  Mozer,  1984;  Reijf  and  Luhmann,  1986; 
de  la  Beaujardiere  et  al.,  1987;  Papitashvili  and  Papitashvili,  1996].  It  was  suggested  that  this 
potential  might  be  as  low  as  6-10  kV,  however,  the  accepted  maximum  values  are  between  15 
and  25  kV. 


Table  1. 


Season 

Hemi¬ 

sphere 

Background  Potentials 

Bz  (- 

-1  nT)  Coefficients 

IZMEM 

k„ 

kc 

a 

b 

k0 

kc 

a 

B 

F 

Summer 

North 

0.71 

0.89 

0.01 

3.29 

0.60 

0.85 

-0.99 

2.51 

3.2 

South 

0.79 

0.89 

-5.19 

2.58 

0.80 

0.91 

-1.44 

3.77 

3.0 

Equinox 

North 

0.79 

0.89 

0.36 

1.82 

0.71 

0.86 

-2.92 

1.47 

2.3 

South 

0.67 

0.80 

-4.00 

4.07 

0.69 

0.81 

-3.02 

2.25 

2.7 

Winter 

North 

0.69 

0.81 

-5.96 

1.24 

0.65 

0.86 

-3.86 

1.51 

1.5 

South 

0.32 

0.51 

-1.36 

1.77 

0.41 

0.55 

-4.81 

1.81 

2.5 

As  seen  from  Table  1,  most  of  the  coefficients  b  for  background  potentials  are  in  order  with 
the  IZMEM  amplification  factors/  Smaller  b  are  obtained  for  the  Bz  (-1  nT)  regression  coeffi¬ 
cients  than  for  the  background  potentials.  This  suggest  that  both  interaction  mechanisms  (quasi- 
viscous  and  reconnection)  behave  differently  producing  the  ionospheric  responses  on  changing 
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IMF  and  solar  wind  conditions.  We  conclude  that  now  these  new  coefficients  a  and  b  can  be 
utilized  for  re-calibration  of  the  original  IZMEM  model  [see  discussion  for  the  need  of  such  a 
calibration  in  Papitashvili  et  al.,  1994]. 

Table  2  shows  maximum  cross-polar  potentials  estimated  from  the  DMSP  data  (Figure  4-5) 
in  comparison  with  the  earlier  calibrated  IZMEM  model.  One  can  see  that  DMSP  potentials  are 
comparable  with  earlier  estimates;  the  average  potentials  in  two  models  differ  only  by  3.6  kV, 
that  is,  well  within  the  standard  deviation  for  both  data  sets.  Similar  difference  for  the  Bz  (-1  nT) 
regression  coefficients  is  1.8  kV,  that  is,  within  the  standard  deviation. 


Table  2. 

Season 

Hemisphere 

Background  Potentials,  kV 

Bz  (-1  nT)  Coefficients  (kV) 

IZMEM 

DMSP 

IZMEM 

DMSP 

Summer 

Northern 

30.4 

29.5 

14.1 

11.4 

Southern 

37.8 

35.4 

10.5 

12.9 

Equinox 

Northern 

41.4 

33.5 

15.4 

10.8 

Southern 

30.5 

35.8 

13.0 

11.0 

Winter 

Northern 

40.5 

34.1 

13.0 

12.8 

Southern 

40.5 

31.0 

15.6 

11.6 

Average 

36.8  ±5.1 

33.2  ±5.3 

13.6  ±  1.9 

11.8  ±2.0 

Conclusions 

Two  major  conclusions  can  be  drawn  from  the  presented  analyses.  First,  the  cross-polar 
potential  drop  estimated  from  DMSP  data  is  in  order  of  ~33  kV.  This  results  provides  a  strong 
experimental  confirmation  that  the  background  (viscous)  potential  is  considerably  larger  than  the 
currently  accepted  values  (e.g.,  maximum  -15-20  kV),  though  it  requires  further  confirmation  for 
the  northward  IMF  conditions.  Second,  the  DMSP  data  show  that  the  ionospheric  potential  re¬ 
sponse  on  the  1-nT  step  in  the  southward  IMF  is  -12  kV.  This  number  is  close  to  the  earlier  re¬ 
sults  obtained  for  the  IZMEM  model. 

We  have  to  conclude  this  study  with  the  presented  results  though  the  DMSP  data  have  been 
processed  far  enough  to  undertake  similar  regression  analyses  for  the  northward  IMF  conditions, 
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as  well  as  for  estimating  the  responses  on  changing  azimuthal  component  of  the  IMF.  We  recog¬ 
nized  that  duration  of  our  research  term  (8  weeks)  under  the  auspices  of  Air  Force  Summer  Fac¬ 
ulty  Research  Program  was  simply  not  enough  for  this  huge  computer  time-consuming  effort. 
This  prevented  us  to  accomplish  the  entire  study  "at-once"  and  to  attempt  the  construction  of  a 
fully  operational  model  of  the  DMSP  potentials  parameterized  by  the  IMF  and  solar  wind  condi¬ 
tions.  We  hope  that  we  can  find  a  possibility  to  continue  this  interesting  study  in  near  future. 
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Abstract 

By  analyzing  the  wavelet  averages  of  thermal  dissipation  in  the  troposphere  mea¬ 
sured  by  the  Argus  platform,  it  is  possible  to  determine  regions  where  Kolmogorov’s 
universal  equilibrium  hypotheses  are  valid.  Not  surprisingly,  these  regions  of  the  flow 
exhibit  spectra  obeying  the  predicted  A;-5/3  power  law.  In  these  regions,  one  also  finds 
pth-order  structure  functions  that  similarly  obey  the  predicted  rp/3  power  law.  How¬ 
ever,  the  nature  and  distribution  of  the  regions  of  the  flow  wherein  the  dissipation 
is  not  constant  remains  unexplored,  save  that  power  law  scalings  are  inapplicable  in 
many  cases.  For  all  that  it  can  accomplish,  my  treatment  is  relatively  primitive.  The 
underlying  physical  model  is  fairly  simple  and  would  benefit  from  further  analysis  such 
as  multifractal  distributions  of  scaling  exponents.  Also,  more  sophisticated  signal  pro¬ 
cessing  would  improve  the  quality  of  the  data  we  are  trying  to  analyze.  For  instance, 
the  cold- wire  probe  response  smooths  the  data  to  a  resolution  of  50  centimeters,  though 
the  data  acquisition  rate  is  capable  of  resolving  scales  down  to  2  centimeters.  It  is  a 
distinct  possibility  that  the  high  frequency  portion  of  the  data  can  be  recovered  by 
deconvolving  the  raw  data  with  an  appropriate  filter.  This  summer,  I  have  been  able 
to  support  Air  Force  calculations  of  the  upper  atmosphere  by  studying  fundamental 
issues  in  turbulent  mixing  and  signal  processing. 
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1  Introduction 


With  AFOSR  support,  I  have  entered  into  a  productive  collaboration  with  Phillips  Labora¬ 
tory  Hanscom  and  Kirtland  scientists  to  analyze  aerothermal  data  in  connection  with  the 
Airborne  Laser  (ABL)  project,  and  at  the  same  time,  analyze  the  fundamental  properties 
underlying  the  turbulent  mixing  of  passive  scalars.  The  underlying  objective  of  this  work  is 
to  make  decisions  on  whether  or  not  a  laser  beam  can  be  propagated  through  the  upper  atmo¬ 
sphere  over  distances  of  hundreds  of  kilometers  in  certain  theatres  of  operation.  Aerothermal 
measurements  are  relatively  inexpensive  compared  to  airborne  optical  experiments.  Knowing 
the  statistics  of  the  thermal  fluctuations  over  the  path  of  the  laser, 

E(k)  oc  C\k~a  (1) 

where  E(k )  is  the  power  spectral  density  of  the  thermal  fluctuations,  C\  is  a  reflection  of  the 
strength  of  these  variations  and  a  is  predicted  by  Kolmogorov,  Oboukov  and  Corrsin  to  be 
-5/3,  it  is  possible  to  simulate  the  beam  propagation  and  make  a  decision  as  to  whether  the 
beam  will  scintillate  or  not.  Of  course,  without  proper  treatment,  the  aerothermal  data  is 
useless  and  tells  us  nothing.  As  a  personal  note,  it  is  gratifying  to  see  that  the  fundamental 
problem  of  passive  scalar  mixing  remains  at  the  heart  of  a  critical  application  such  as  the 
ABL  project. 

2  Some  pertinent  aspects  of  atmospheric  turbulence 
and  passive  scalar  mixing 

The  central  issues  in  this  paper  are  Kolmogorov’s  universal  equilibrium,  turbulent  mix¬ 
ing,  intermittent  bursts,  integral  scales  over  which  averages  are  taken,  and  non-stationarity. 
Kolmogorov’s  seminal  works,  often  referred  to  as  K41  and  K61  [15,  14,  16],  are  the  most 
substantial  and  predictive  description  of  turbulence  from  first  principles  through  the  Karman- 
Howarth-Monin  relation.  Overviews  of  Kolmogorov’s  assumptions  and  its  implications  can 
be  found  in  and  many  others  [7,  21,  22,  29].  While  these  ideas  only  apply  to  kinetic  energy 
dynamics,  velocity  increments,  power  spectral  densities  and  related  quantities,  Corrsin  and 
Oboukhov  independently  extended  universal  equilibrium  concepts  to  thermal  fluctuations 
[6,  23],  and  this  was  later  confirmed  experimentally  by  Gibson  and  Schwarz  [9].  One  must 
also  take  into  account  that  while  the  equations  governing  the  motions  of  slight  thermal  varia¬ 
tions  and  passive  scalars  are  linear,  they  are  coupled  to  the  nonlinear  Navier-Stokes  equations, 
and  even  simple  solutions  can  yield  chaotic  trajectories  and  effectively  mix  scalar  quantities 
(see  [2]  for  an  overview) .  In  this  article,  we  will  be  analyzing  thermal  data  in  the  absence  of 
any  information  about  the  kinetic  energy  of  the  flow.  Furthermore,  experimental  evidence 
shows  that  universal  equilibrium,  and  in  particular  constant  mean  dissipation,  is  violated  by 
intermittent  bursts,  the  statistics  of  which  are  an  active  area  of  research  [5,  20,  24,  26,  27]. 
Central  to  the  understanding  of  the  statistics  underlying  any  fluid  quantity,  /,  is  its  average, 
(/).  Since  atmospheric  flows  contain  many  large-scale  structures  that  vary  with  time,  a 
considerable  amount  of  attention  is  dedicated  to  determining  meaningful  ways  to  calculate 
(/).  In  this  paper,  we  study  each  of  these  subjects  using  wavelet  analysis. 
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My  analysis  focuses  on  horizontal  measurements  of  thermal  fluctuations  in  the  tropo¬ 
sphere  where  the  Reynolds  numbers  are  very  large.  Universal  equilibrium  has  been  very 
successful  in  describing  the  homogeneous  turbulence  in  a  wide  range  of  laboratory  and  field 
experiments.  However,  in  atmospheric  flows,  there  is  very  strong  evidence  that  there  are 
many  distinct  regions  within  the  atmosphere,  each  having  distinct  turbulent  statistics.  Many 
of  these  flat  pancake-like  regions  or  “blinis”  co-exist  at  the  same  altitude,  defying  the  simpler 
assumption  that  the  atmosphere  can  be  treated  as  sphere  covered  by  many  concentric  layers 
like  an  onion,  each  with  its  own  uniform  turbulent  statistics.  While  universal  equilibrium 
exists  within  the  distinct  pancake-like  regions,  the  kinetic  or  thermal  dissipations  may  vary 
from  one  region  to  the  next. 

Within  these  atmospheric  layers  at  small  scales,  the  incompressible  Navier-Stokes  equa¬ 
tions  apply: 

<9tu  +  (u  •  V)u  =  —  Vp  +  ^-V2u,  (2) 

Ke 

where  u(x,  t)  is  the  velocity  field,  p(x,  t)  is  the  pressure,  and  Re  is  the  Reynolds  number,  a 
dimensionless  ratio  of  inertial  to  viscous  forces.  Furthermore,  we  assume  that  the  velocity 
field  is  incompressible, 

V  •  u  =  0,  (3) 

and  that  fluctuations  in  the  temperature,  0(x,t),  are  very  slight  and  can  be  treated  as  a 
passive  quantity.  That  is,  0  moves  and  diffuses  with  the  flow, 


dt6  +  (u  •  V)0  =  ^V20,  (4) 

re 

but  6  does  not  affect  the  fluid  velocity  as  described  by  Equations  (2)  and  (3).  Here,  Pe  is 
the  Peclet  number,  a  dimensionless  ratio  of  inertial  to  diffusive  forces  in  the  scalar  field.  The 
mean  temperature  plays  no  role  in  this  analysis  because  equation  (4)  is  linear  and  involves 
only  derivatives  of  6. 

Kolmogorov’s  universal  equilibrium  assumptions  operate  in  the  limit  Re  — »  oo,  at  small 
scales,  and  far  from  boundaries  in  isotropic  homogeneous  flows.  A  more  detailed  discussion 
can  be  found  in  [7,  11,  29].  The  only  aspect  of  universal  equilibrium  that  we  shall  focus  on 
here  is  the  hypothesis  that  the  energy  dissipation  per  unit  mass,  given  by 


e  = 


1  ^  (dm 
2Re  ^  \dxj 


(5) 


remains  constant.  Provided  all  Kolmogorov’s  assumptions  are  met,  one  can  expect  a  num¬ 
ber  of  consequences.  First,  for  very  large  Reynolds  numbers,  there  exists  a  range  of  scales, 
called  the  inertial  range ,  where  the  turbulent  flow  statistics  depend  solely  on  scale  (or  wave¬ 
length)  and  the  (constant)  mean  dissipation.  If  the  flow  is  stationary  and  isotropic,  then  the 
correlation  function  depends  only  on  the  distance: 


(u(x  +  r)  •  u(x))  =  B(r),  r  =  |r|,  (6) 

where  (•)  denotes  the  ensemble  average  over  all  possible  solutions.  (In  truth,  our  analysis 
requires  only  local  stationarity,  as  will  be  seen  later.  In  §7,  we  will  segment  the  data  into 
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distinct  regions  wherein  equation  (6)  is  valid.)  To  describe  the  statistics  of  the  flow  in  the 
inertial  range,  define  the  power  spectral  density, 

E(k)  =  B(k),  (7) 

(where  B  is  the  Fourier  transform  of  B),  and  the  pth-order  structure  function, 

5p(r)  =  (|5u(r)|p),  p>0,  (8) 

where  <5u  is  a  velocity  increment 

(Su(r)  =  u(x  +  r)  -  u(x).  (9) 

Then,  a  second  consequence  of  Kolmogorov’s  assumptions  is  that  the  power  spectral  density 
and  the  structure  function  obey  the  following  power  laws  within  the  inertial  range: 

E{k)  =  Ck~5/3,  (10) 

S„(r)  =  Crr-'3.  (IX) 

As  aluded  to  earlier,  the  power  law  behavior  in  kinetic  energy  and  velocity  increments 
can  be  extended  to  passive  scalar  quantities  described  by  equation  (4).  In  this  case,  one 
must  assume  that  the  Peclet  number  is  very  large  and  the  thermal  dissipation, 


remains  constant  as  Pe  — >  oo.  As  first  deduced  by  Batchelor  and  elaborated  by  others, 
several  regimes  can  exist  where  Re  or  Pe  can  play  some  role  in  the  dynamics  [3,  4,  10,  17]. 
However,  the  scales  at  which  we  measure  thermal  variations  in  the  troposphere  are  well 
within  the  inertial-convective  range  where  the  flow  properties  depend  only  on  x>  £  and  the 
wavelength  (or  scale)  of  the  quantities  being  described.  In  this  case,  the  power  spectral 
density  will  be  proportional  to  Ar5/3  and  the  temperature  structure  functions, 

Tp(  r)  =  ([»(r)]”>,  (13) 

are  predicted  to  have  similar  p/3  exponents, 

Tp(r)  =  C|r|p/3.  (14) 

Of  course,  the  theorems  of  Kolmogorov  and  others  are  based  on  these  existence  of  a  universal 
equilibrium  and  an  inertial-dissipative  range,  and  so  their  applicability  to  real  measurements 
of  thermal  fluctuations  is  severly  limited  by  our  ability  to  find  regions  of  fluid  flows  where 
these  conditions  exist. 

Sadly,  the  fact  is  that  neither  kinetic  energy  dissipation  nor  thermal  dissipation  has  been 
known  to  remain  constant  in  the  inertial  or  inertial-dissipative  range.  Indeed,  intermit¬ 
tent  bursts,  characterized  by  regions  of  relatively  intense  dissipation,  play  a  central  role  in 
real  turbulent  flows.  The  statistical  properties  of  these  bursts  are  the  subject  of  intensive 
phenomenological  efforts,  the  most  successful  of  which  involve  log-normal  and  log-Poisson 
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statistics  for  the  turbulent  dissipation  [8,  26].  As  a  further  complication,  the  scaling  expo¬ 
nents  for  passive  scalar  quantities  have  been  observed  to  be  quite  different  from  those  for 
velocity  structure  functions  and  theoretically  have  very  different  properties  [1,  18,  19,  28]. 
Nonetheless,  turbulent  flows  typically  contain  large  regions  over  which  the  mean  dissipation 
is  roughly  constant.  Rather  than  attempt  to  model  the  bursts,  we  shall  use  wavelets  to  iden¬ 
tify  the  localized  bursts  and  segment  the  data  into  regions  where  Kolmogorov’s  universal 
equilibrium  hypotheses  are  likely  to  apply. 

Underlying  these  results  and  applications  is  the  notion  of  a  mean  quantity  or  ensemble 
average.  Since  ensemble  averages  are  not  generally  available,  we  rely  upon  spatial  or  temporal 
averages  over  some  fixed  interval.  Wavelets  are  especially  useful  when  analyzing  average 
quantities  because  the  wavelet  scaling  function  naturally  computes  the  mean  of  a  function. 
Of  course,  wavelets  do  not  automatically  select  the  appropriate  scale  for  which  a  spatial 
average  will  be  sufficiently  close  to  the  true  mean.  This  scale,  called  an  integral  scale,  is 
related  to  velocity  and  thermal  autocorrelations. 


3  A  track  shoe  summary  of  fast  Haar  wavelet  trans¬ 
forms 


This  section  is  not  intended  to  provide  a  tutorial  in  wavelet  analysis.  (For  this,  I  would 
recommend  Gerry  Kaiser’s  most  amicable  book  [12].)  Rather,  I  will  describe  in  broad  strokes 
Haar  wavelets  and  why  they  can  help  us  understand  turbulent  data. 

The  basic  idea  in  wavelet  analysis  is  to  use  a  localized  function,  ip{x),  to  create  a  family 
of  functions  or  wavelets 

iMz)  =  (“7“ )  (15) 

where  s  >  0.  The  wavelet  has  scale,  s,  and  position,  t.  Convolving  with  a  wavelet  is  similar 
to  a  windowed  Fourier  transform  except  that  in  this  case  the  window  is  built  into  and  scales 
with  the  wavelet.  Using  this  wavelet,  one  can  define  a  continuous  wavelet  transform  (CWT) 

_  roc 

ips<t(x)f{x)dx  (16) 

J—OO 


Like  the  Fourier  transform,  there  is  an  inversion  formula,  but  we  shall  leave  CWTs  at  this 
point  and  move  to  discrete  Haar  transforms. 

If  ip  has  reasonable  properties,  the  continuous  (R+  x  R )  basis  can  be  discretized  (Z  x 
Z ),  permitting  multiresolution  analysis  of  the  signal.  Multiresolution  analysis  consists  of  a 
scaling  function  <f>(x)  and  associated  wavelet  ip(x).  This  scaling  function  is  a  localized  bump. 
The  wavelet  can  be  deduced  from  the  scaling  function.  Together,  the  scaling  function  and 
the  wavelets  of  infinitely  decreasing  scale  can  form  a  family  of  nested,  orthogonal  bases. 

The  scaling  function  for  Haar  wavelets  is  simply  the  indicator  function: 


4>(x) 


1  if  0  <  t  <  1 
0  otherwise 


(17) 


Notice  that  a  convolution  with  (f>  is  a  moving  average.  Assuming  that  a  turbulent  flow  is 
ergotic,  the  Haar  wavelet  average  directly  corresponds  to  an  ensemble  average  if  the  scale  is 
chosen  properly. 
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Detail  information  is  obtained  from  the  Haar  wavelet  which  is  a  linear  combination  of 
translates  of  the  compressed  Haar  scaling  function: 

4>{x)  =  <f>( 2x)  -  (f>[2x  -  1)  .  (18) 


Together,  the  Haar  scaling  function  and  Haar  wavelets  form  a  discrete  orthonormal  basis 


'Ipmn  —  | 

f  \  771  x 

(19) 

with  which  one  can  decompose  a  signal  into  local  scale  components. 

(ft) 

roc 

/  0?7i7i  (*^0  f  (*^)  doc 

J— oo 

(20) 

dm  (ft) 

roo 

j  0m7i(^)/(^)d^ 

J— 00 

(21) 

Then 

oo 

oo  oo 

f(x)  =  53  ao(ft)^On(^)  +  53  53  dm  (ft)  *077171 0*0 

(22) 

n=— oo  m— 0  n=— oo 


Furthermore,  there  is  a  direct  correspondence  between  Fourier  spectra  and  wavelet  scale 
spectra.  Since  the  wavelets  are  mutually  orthogonal, 

n  2 

f  [f{x)]2dx  =  f  [  52  ao(n)0O n{x)  +  52  52  dx  (23) 

J— oo  J-oo  |_n=-oo  m=0n=“OO 

oo  oo  oo 

=  53  ao(n)2  +  53  53  dm(n)2.  (24) 

n——oo  m=0  n=~oo 

The  energy  at  scale,  m  >  0  (wavelength  2“m)  is  proportional  to  ( dm(n )2),  where  the  ensemble 
average  is  taken  over  the  n  index  [13].  Thus,  when  I  calculate  spectra,  I  typically  use  wavelet 
scale  spectra.  Further  details  is  contained  in  a  paper  I  am  preparing  with  Gerry  Kaiser  and 
Don  Washburn  [25]. 

4  A  simple  segmentation  procedure 

This  segmentation  of  the  thermal  data  is  based  upon  finding  regions  where  Kolmogorov’s 
hypotheses  are  valid  because  the  thermal  dissipation  is  constant  is  the  sense  that  averages  of 
the  dissipation  over  integral  scales  remain  constant.  This  approach  is  based  on  the  following 
principal  assumptions. 

Assumption  1  The  temperature  field  being  measured  varies  along  streamlines. 

Trivially,  if  there  are  no  temperature  variations,  no  mixing  can  occur.  Furthermore,  if  the 
temperature  field  is  constant  along  streamlines  for  all  time  (though  it  is  difficult  to  imagine 
this  outside  of  a  laminar  regime)  mixing  is  restricted  to  diffusive  transport  and  advection 
will  play  no  role.  However,  if  there  are  thermal  fluctuations  along  streamlines,  advective  and 
diffusion  mixing  can  occur.  For  a  turbulent  flow,  this  assumption  is  hardly  restrictive. 
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Assumption  2  A  sudden  increase  in  kinetic  energy  dissipation,  (e),  will  correspond  to  a 
similar  increase  in  thermal  dissipation,  (x). 

This  assumption  is  only  stated  in  one  direction  because  chaotic  mixing  is  possible  without  an 
underlying  turbulence  flow.  However,  localized  bursts  of  kinetic  energy  dissipation  directly 
correspond  to  intense  velocity  gradients.  These  gradients  generate  the  strong  shears  required 
for  advective  mixing  in  any  laminar  or  turbulent  flow.  Of  course,  determining  these  quantities 
requires  an  ensemble  or  spatial  average. 


5  Aside:  integral  scales 

To  come  close  to  calculating  a  true  ensemble  average,  (0),  we  must  resort  to  a  temporal 
average,  or  a  spatial  average  relying  upon  Taylor’s  hypothesis.  Without  delving  further  into 
issues  of  ergoticity,  we  shall  assume  that  we  can  use  spatial  averages,  and  focus  instead  upon 
the  integral  scale  or  the  scale  over  which  averages  converge  to  the  true  mean.  Knowing 
the  correct  scale  can  determine  which  wavelet  scaling  functions  will  be  meaningful  when 


measuring  spectra  or  structure  functions. 

If  we  define 

1  rXo+L 

{0)l  =  y  /  0(s)ds , 

L/  Jx o 

(25) 

where  it  is  understood  that  the  average  (0)l  can  be  displaced  an 
quantity  to  calculate  is  the  error 

arbitrary  distance  rro,  the 

e(L)  =  mi  -  m2)- 

(26) 

In  this  context,  (0)  can  be  defined  to  be  limL_+00{^)i.  Without  loss  of  generality,  we  shall 
treat  xo  =  0  from  this  point  onward.  If  we  designate  the  zero-mean  portion  of  the  function 

to  be 

[ 

II 

(27) 

then 

£  (L) 

=  mi) 

(28) 

= 

(29) 

=  -k  [L  I  (d{si)9{s2))ds1ds2. 
Lz  Jo  Jo 

(30) 

If  the  function  8  is  stationary 

over  (sj,  S2)  €  [0,  L]  x  [0,  L\,  then 

- 

{d(Sl)e(s2))  =  P(Sl  -  s2){e2) 

(31) 

where  p( A)  is  the  autocorrelation  coefficient  and  A  is  the  separation  between  the  points 
being  correlated.  Substituting  the  autocorrelation  coefficient  into  the  previous  expression, 
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changing  variables  to  A  =  (si  -  s2)/2  and  D  =  (sx  4-  s2)/2,  and  using  the  fact  that  p(A)  is 
even,  one  obtains 


e(L)  = 


(02)  [L  rL 

FFIo  k  "(si  -  S2)ds'ds* 


0) 

L2 


■i/2  /-i— |A| 

-i/2  -/-(i-|A|) 


p{A)dDdA 


2 (02)  fi/2 


Jo 


(32) 

(33) 

(34) 


For  large  L,  the  contribution  by  the  |A|p(A)/L  term  is  small  because  p( A)  decays  to  zero 
independent  of  L.  What  remains  is 

e(L)  =  fL  p(A)dA  «  ^  /°°  p(A)dA.  (35) 

L/  J  o  .Z/  4/0 

The  integral  /0°°  p(  A)dA  has  units  of  length  (or  time  for  temporal  averages)  and  is  commonly 
referred  to  as  the  integral  scale,  C.  Hence,  the  error  in  one’s  estimate  of  (9)  is  approximately 

e(£)  «  2(02)~,  (36) 

and  one  can  expect  ( 9)l  to  be  a  good  approximation  if  one  averages  over  several  integral 
scales. 

Of  course,  a  given  integral  scale  is  only  valid  in  one  stationary  domain.  Furthermore,  the 
integral  scale  varies  with  the  function  one  is  measuring.  For  instance,  rather  than  measure 
the  mean  temperature,  (9)L,  one  might  be  interested  in  approximating  structure  functions, 


Tr(r) » 


(37) 


Also,  even  if  one  can  calculate  the  integral  scale  in  terms  of  correlations,  it  is  not  clear  how 
many  integral  scales  one  must  use  for  one’s  calculations.  However,  we  shall  see  in  the  next 
subsection  that  these  scales  can  also  be  determined  empirically  directly  from  the  data. 


6  Treatment  and  limitations  of  the  data 

The  high-resolution  data  from  the  airborne  laboratory  is  filtered  electronically  into  low-pass 
and  high-pass  components  as  it  is  collected.  Since  we  are  only  interested  in  temperature 
fluctuations  away  from  a  mean  quantities,  we  only  study  the  high-pass  component.  Various 
data  sets  are  gathered  at  different  altitudes,  airspeeds  and  flight  paths.  In  order  to  compare 
different  data  sets,  all  time  variables  are  converted  into  spatial  variables  using  the  jets  true 
airspeed.  Thus,  from  one  data  set  to  another,  the  sampling  intervals  may  vary  though 
this  variation  is  slight.  As  a  rule  of  thumb,  the  instruments  are  capable  to  measuring  the 
temperature  every  2  centimeters.  However,  the  cold-wire  is  much  slower  than  this,  and  one 
would  only  expect  to  resolve  features  larger  than  20  cm  without  more  sophisticated  treatment 
of  the  data.  Also,  when  atmosphere  turbulence  intensities  are  low,  the  turbulence  generated 
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Spectra 


Spectra 


Figure  1:  Empirical  determination  of  integral  scale  by  calculating  the  spectra  over  varying 
intervals.  By  varying  the  interval  over  which  scale  spectra  are  calculated,  it  is  clear  to  see 
what  lengths  are  required  for  good  frequency  resolution.  These  calculations  are  made  over 
a  long  segment  of  near  constant  dissipation. 

by  the  aircraft  also  contaminates  the  data  at  wavelengths  corresponding  to  about  6  meters. 
Since  the  high  pass  filter  rolls  off  at  about  60  meters,  we  consider  our  data  to  be  valid  for 
roughly  one  decade  between  6  and  60  meters  in  low  turbulence.  If  the  turbulence  is  strong, 
we  can  expect  to  resolve  inertial  ranges  from  60  meters  down  to  about  20  cm  where  probe 
effects  become  important. 

Since  the  aircraft  only  has  one  probe  and  can  only  fly  in  one  direction  at  the  same  time, 
we  cannot  calculate  all  three  components  of  the  thermal  dissipation,  but  we  can  calculate 

(df)  usinS  centered  differences.  If  the  flow  is  locally  isotropic  within  the  resolution  of  our 
measurements  (less  than  one  meter!)  and  we  average  over  a  small  distance,  we  rest  assured 
that  this  quantity  is  proportional  to  the  thermal  dissipation.  George  Papanicolaou  and  his 
group  have  calculated  £  for  the  96223  data  and  found  the  integral  scale  to  be  roughly  2 
to  4  km,  in  agreement  with  my  empirical  calculations  based  on  spectral  convergence  and 
convergence  of  the  structure  functions.  In  Fig.  1,  one  sees  that  the  spectrum  converges  over 
scales  of  between  2  and  5  km.  Similarly,  in  Fig.  2,  the  12th  order  structure  function  converges 
with  similar  averaging  intervals. 

7  Results  of  segmentation 

By  examining  the  wavelet  averages  of  the  thermal  dissipation  data,  we  find  substantial 
advantages  to  segmenting  the  data  in  regions  of  uniform  dissipation.  If  one  measures  spectra 
and  structure  functions  without  regard  to  dissipation,  one  quickly  finds  that  the  spectra  and 
structure  functions  do  not  obey  K41  power  laws.  However,  calculating  the  same  functions 
along  regions  that  have  constant  dissipation,  one  recovers  Kolmogorov  scalings.  Furthermore, 
one  can  calculate  spectra  over  many  disconnected  subsets  of  the  signal  in  instances  where 
the  size  a  single  region  of  constant  dissipation  is  small  relative  to  the  integral  scale. 
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12lh  order  structure  function  12lh  order  structure  function 


Figure  2:  Empirical  determination  of  integral  scale  by  calculating  the  structure  functions 
over  varying  intervals.  Once  again,  the  integral  scale  for  turbulent  mixing  can  be  calcu¬ 
lated  empirically  by  varying  the  interval  over  which  structure  functions  are  averaged.  These 
calculations  are  made  over  a  long  segment  of  near  constant  dissipation. 

A  second  and  unforeseen  advantage  to  wavelet  analysis  of  the  thermal  dissipation  is  the 
discover  of  regular  patterns  buried  within  the  thermal  measurements.  It  should  be  added 
at  this  point  that  we  attempted  wavelet  analysis  of  the  thermal  data  as  a  supplement  to  an 
exhaustive  Fourier  analysis  of  this  data  at  Phillips  Laboratory.  Whether  or  not  the  source  of 
these  patterns  are  natural  or  artificial  is  still  under  active  investigation,  but  the  fact  remains 
that  their  existence  was  invisible  to  standard  Fourier  analysis. 


8  Segmentation  based  upon  dissipation 

As  described  earlier,  a  signal  (dissipation)  can  be  represented  as  the  sum  of  a  Haar  wavelet 
average  plus  an  infinite  sum  of  Haar  wavelet  differences.  One  can  choose  the  scale  of  the 
averages  arbitrarily,  and  this  determined  the  coarsest  scale  of  the  wavelet  differences.  In  the 
case  of  thermal  dissipation  in  a  turbulent  flow,  the  Haar  averages  are  physically  relevant 
because 

Am  =  (x)2m-  (38) 

Thus,  we  examine  wavelet  averages  primarily  even  though  the  orthogonal  decomposition 
of  the  signal  involves  the  wavelet  differences.  Often  supporting  information  can  also  be 
gathered  from  the  wavelet  differences. 

Upon  viewing  wavelet  averages  and  differences  of  the  thermal  dissipation  in  the  tropo¬ 
sphere,  one  is  immediately  impressed  with  the  fact  that  the  dissipation  is  not  constant  even 
at  coarse  resolutions  (see  Fig.  3).  However,  within  the  80  kilometer  segment,  there  are  some 
regions  wherein  the  dissipation  does  appear  to  be  roughly  constant.  For  instance,  the  first 
45  kilometers  of  the  thermal  dissipation  shown  in  Fig.  3  exhibits  a  thermal  dissipation  some¬ 
where  between  4  x  10-3  and  5.1  x  10-3.  Afterward,  there  are  localized  regions  where  the 
dissipation  surges  several  times  higher  depending  upon  the  scale  over  which  one  is  averaging. 
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Figure  3:  Dilated  Haar  wavelet  averages  and  differences  of  (Jf£)  oc  x  for  one  set  of  thermal 
measurements  in  the  troposphere.  Haar  wavelet  averages  can  be  thought  of  as  running 
averages  of  the  signal,  and  the  differences  can  be  expressed  as  differences  of  averages  at  the 
previous  (finer)  scale. 
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Spectra  over  different  segments 


12lh  order  structure  function 


Figure  4:  Spectrum  (left)  and  12th-order  structure  function  (right)  at  different  positions 
using  the  signal  displayed  in  Fig.  3.  In  Segment  A  (displaced  30,000  m),  the  dissipation 
is  roughly  constant  while  in  Segment  B  (displaced  50,000  m),  the  dissipation  is  changing 
abruptly.  A  best  fit  to  the  data  over  the  range  of  values  not  affected  by  the  low-pass  roll-off 
and  the  aircraft  induced  turbulence  shows  that  Kolmogorov  spectra  is  realized  in  Segment 
A  while  it  is  not  in  Segment  B.  The  high  order  structure  function  on  the  right  shows  an 
even  sharper  difference  between  the  two  segments.  Larger  windows  are  required  for  the  high 
order  structure  functions  because  they  have  longer  integral  scales. 


Another  interesting  feature  of  thermal  dissipation  in  the  troposphere  is  that  the  brief 
regions  of  elevated  thermal  dissipation  have  substantial  structure  at  small  scales  while  the 
regions  of  constant  dissipation  have  relatively  little  structure  aside  from  expected  small  scale 
variations  as  2mA t  drops  below  the  integral  associated  with  averaging  thermal  dissipation. 
(Though  2  kilometers  would  hardly  qualify  as  “brief’  is  many  atmospheric  contexts,  we  use 
the  term  here  relative  to  the  vast  regions  of  constant  dissipation.)  We  can  segment  the  data 
using  the  wavelet  averages  of  dissipation  to  tell  us  where  Kolmogov  university  equilibrium 
hypotheses  are  valid.  Not  surprisingly,  if  one  calculates  the  power  spectral  density  in  a  region 
of  constant  dissipation,  one  recovers  Kolmogorov  spectrum.  Moreover,  one  can  also  recover 
very  high  order  structure  functions  in  these  regions  (see  Fig.  4).  However,  segmenting  the 
data  near  the  localized  bursts  can  yield  very  non-Kolmogorov  behavior. 

This  behavior  is  explained,  not  in  terms  of  intermittent  bursts,  but  in  terms  of  stationar- 
ity.  In  this  case,  the  elevated  regions  of  thermal  dissipation  can  occupy  several  kilometers  of 
space.  While  they  are  localized,  these  regions  have  a  substantial  scale.  This  data  is  entirely 
consistent  with  radar  observations  of  large  flat  regions  in  the  atmophere  co-existing-  at  the 
same  level.  In  other  words,  one  explanation  for  the  data  is  that  the  aircraft  passed  through 
a  2  kilometer  region  having  a  mean  thermal  dissipation  different  from  the  surrounding  at¬ 
mosphere.  If  this  were  the  case,  we  should  be  able  to  recover  Kolmogorov  statistics  when 
we  calculate  spectra  entirely  within  the  burst.  Indeed,  as  seen  in  Fig.  5,  this  is  precisely 
the  case.  Anomalous  behavior  in  spectra  can  be  explained  as  a  lack  of  frequency  resolution 
for  the  dominant  part  of  the  signal.  For  this  window,  one  would  expect  the  spectrum  to  be 
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The  anatomy  of  a  burst 

...wavelet  average  (a10)  of  dissipation 


Spectra  near  a  burst 


Figure  5:  Spectra  calculated  near  a  burst.  On  the  left,  one  can  see  three  segments  near 
a  sharp  increase  in  dissipation.  The  first  is  in  a  region  of  relatively  calm  dissipation.  The 
last  is  entirely  within  the  region  of  high  dissipation.  The  middle  segment  straddles  the  two. 
On  the  right,  we  calculate  the  spectra  over  these  intervals.  Notice  that  the  spectrum  does 
not  obey  a  power  law  in  the  transition  region.  Though  a  best  fit  is  provided,  one  can  see 
considerable  variability  in  the  slope  over  the  region  of  interest. 

the  sum  of  two  distinct  Ck~ 5/3  spectra.  The  high  dissipation  portion  would  dominate  the 
sum  because  C  is  directly  related  to  x ■  However,  one  can  also  see  in  Fig.  3  that  the  high 
dissipation  portions  of  the  signal  has  relatively  small  measure,  meaning  that  there  is  very 
low  frequency  resolution.  Therefore,  one  would  expect  the  spectrum  and  structure  functions 
to  have  a  fair  amount  of  noise  in  them. 


9  Observation  of  embedded  waves 

In  addition  to  providing  information  about  thermal  dissipation,  the  wavelet  analysis  of  the 
thermal  dissipation  uncovered  an  unanticipated  feature  in  the  data  that  may  be  physical  or 
may  be  instrumental.  There  are  distinct  asymmetric  square  waves  in  the  dissipation. 

Re-examining  Fig.  3,  one  will  notice  a  regular  pattern  in  the  calm  region  extending 
from  13  km  up  to  30  km,  and  it  would  appear  that  there  is  a  square  wave  buried  in  the 

turbulence  signal.  This  apparent  periodicity  in  means  that  the  roughness  of  9(x)  has 

a  regular  spatial  structure  though  it  says  nothing  about  the  magnitudes  of  the  temperature 
fluctuations  themselves.  These  fluctuations  are  very  minor  accounting  for  less  than  1%  of 
the  total  energy  in  the  data,  yet  they  persist  in  every  broad  region  of  constant  dissipation 
through  all  the  data  collected  on  this  particular  day  (see  Fig.  6).  There  are  two  reasons  why 
features  like  this  do  not  emerge  with  the  traditional  approaches  currently  used  at  Phillips 
Laboratories.  First,  they  have  not  been  studying  dissipation  because  they  are  basing  their 
analysis  solely  on  phenomenological  reasoning  rather  than  the  dynamics  of  the  mixing.  I 
studying  fundamental  problems  behind  ABL,  I  decided  to  look  at  the  dissipation  and  found 
something  interesting.  The  second  reason  why  it  would  not  show  up  with  the  standard 
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Observed  waves  in  thermal  dissipation 
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Figure  6:  Observed  periodicity  in  thermal  dissipation.  The  lower  graph  is  the  raw  data,  and 
the  upper  graph  is  the  wavelet  average  (running  average  over  210  points)  of  the  same  signal. 
A  suspicious  pattern  emerges  which  we  are  still  trying  to  explain.  Aspects  like  this  in  the 
signal  are  not  detected  with  standard  processing  techniques.  Note:  The  thickness  in  the 
latter  half  of  the  wavelet  series  is  an  artifact  of  the  plotting  package  and  not  the  signal  itself. 


reductions  used  at  Phillips  is  that  they  are  looking  for  features  in  the  Fourier  domain.  While 
these  waves  have  frequency  content,  the  full  signal  also  contains  bursts  and  large  deviations 
from  these  waves  that  would  inhibit  any  spike  that  appears.  Finally,  this  pattern  occurs 
at  wavelength  much  great  than  the  frequency  cutoffs  of  the  data  acquisition  circuitry,  so 
naturally,  one  would  not  look  to  find  signals  at  these  wavelengths. 

Though  we  lack  a  full  explanation  for  the  periodicity,  we  have  studied  it  in  some  detail. 
The  waves  are  asymmetric  in  that  the  peaks  are  short  than  the  troughs.  The  spectrum  of  9 
does  not  vary  in  and  around  the  peaks  and  troughs  as  one  might  expect.  The  wavelengths 
vary  with  altitude  but  not  direction  of  flight.  (If  the  waves  were  physical  and  moved  at  a 
speed  different  from  the  mean  wind  velocity  such  as  leeward  waves  near  mountain  ranges,  we 
would  expect  to  observe  a  Doppler  shift.)  The  variations  in  wavelength  with  altitude  have 
eliminated  systematic  spatial  or  temporal  fluctuations  in  the  aircraft  or  data  acquisition 
software  and  hardware.  The  fact  remains  that  though  we  cannot  explain  them  (yet),  the 
waves  emerged  naturally  from  routine  wavelet  analysis  while  this  is  certainly  not  the  case 
with  Fourier  analysis  of  the  same  data. 
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Future  directions 


There  are  two  avenues  of  exploration  on  which  I  can  provide  immediate  support  to  the 
aerothermal  program,  treatment  of  the  raw  data  and  analytic  techniques  for  analyzing  the 
data.  Some  of  the  problems  with  the  data  collection  can  treated  post-experimentally  and 
some  require  instrumental  changes.  My  expertise  can  only  address  the  first  issue  directly,  but 
I  feel  that  I  can  help  assess  the  impact  of  more  costly  instrumental  changes.  By  understanding 
when  and  where  instrumental  errors  contaminate  the  data,  I  can  help  assess  whether  or  not 
we  really  need  to  correct  them.  For  instance,  the  “noise  floor”  only  appears  is  relatively  low 
intensity  turbulence,  so  perhaps  it  is  not  important  to  correct  a  problem  that  only  exists 
outside  of  the  regime  of  interest.  Nonetheless,  we  need  to  confirm  that  the  noise  floor  is  not 
concealing  information  that  is  of  interest  in  the  long  term. 

Presently,  everyone  involved  with  the  project  treats  voltages  from  the  cold- wires  as  being 
linearly  proportional  to  the  temperature.  Knowing  from  the  ABLE  ACE  report  that  the 
probes’  response  is  somewhere  near  50  cm  (400  hz),  it  is  clear  that  the  data  is  substan¬ 
tially  oversampled.  The  “noise  floor,”  which  most  people  involved  in  the  project  believe  is 
caused  by  aircraft  turbulence,  further  corrupts  the  measurements  up  to  6  meters  in  areas 
of  weak  turbulence.  Finally,  no  one  has  investigated  the  role  of  dynamic  heating  in  these 
measurements. 

The  current  program  of  calculating  Cf.  at  Phillips  Laboratories  Kirtland  is  based  on 
uniform,  systematic  analysis  of  the  aerothermal  data.  This  summer,  I  have  shown  that 
wavelet  analysis  of  thermal  dissipation  can  reveal  some  aspects  of  the  underlying  turbulent 
flow  which  in  turn,  provide  a  degree  of  confidence  to  the  assumption  that  a  power  law 
exists.  The  fact  remains  that  power  law  behavior  might  not  exist  everywhere.  The  data  is 
nonstationary  and  often  subject  to  strong  intermittent  bursts.  Understanding  the  properties 
and  structures  that  are  physically  relevant  in  the  atmosphere  may  lead  to  better  techniques 
for  analyzing  the  raw  data. 


11  Conclusions 

This  summer  has  been  a  very  productive  time  for  me  thanks  to  the  Faculty  Summer  Research 
Program.  In  a  relatively  short  time,  I  have  refamiliarized  myself  with  some  of  the  important 
aspects  of  turbulent  mixing,  and  at  the  same  time,  learned  a  lot  about  wavelets  and  signal 
processing.  During  the  entire  summer,  I  worked  hard  at  keeping  this  work  focused  upon  the 
immediate  needs  of  the  ABL  program  by  pursuing  problems  that  my  focal  point  thought 
were  important.  As  a  result,  my  collaborators  and  I  have  developed  a  new  technique  for 
determining  stationary  regions  in  turbulent  flows  and  calculating  turbulent  statistics  that 
can  be  used  for  comparisons  and  validation.  Now,  I  would  like  to  continue  this  research  into 
the  upcoming  year,  and  I  hope  the  AFOSR  will  continue  to  support  this  effort  thought  their 
Summer  Research  Extension  Program. 
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Abstract 

The  Reusable  Aero  Space  Vehicle  (RASV)  concept  is  considered  in  order  to  analyze  the 
effects  of  various  atmospheric  conditions  upon  a  mission  to  release  a  payload  near  to  a 
polar,  low  earth  orbit.  After  payload  release,  the  spaceplane  is  intended  to  circumnavigate 
the  globe  and  return  to  launch  site  by  skipping  off  the  atmosphere,  while  the  payload 
nominally  includes  a  small  upper  stage  that  boosts  a  satellite  into  a  final  orbit.  In  order  to 
evaluate  atmospheric  effects,  Monte  Carlo  atmospheric  temperature,  pressure,  density,  and 
winds  (based  on  time  of  the  year,  recent  solar  activity,  and  vehicle  geographic  position  and 
altitude)  are  applied  to  a  baseline  RASV  mission.  Results  indicate  that  these  around-the- 
world  RASV  trajectories  are  feasible  under  a  broad  range  of  atmospheric  conditions. 
However,  for  a  given  set  of  flight  controls,  it  is  found  that  maximum  vehicle  altitude, 
aerodynamic  heat  rate,  and  skip  amplitude,  vary  substantially  with  atmospheric 
perturbations. 
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ATMOSPHERIC  EFFECTS  UPON  SUB-ORBITAL 
BOOST  GLIDE  SPACEPLANE  TRAJECTORIES 

David  P.  Stapleton 


Introduction 

It  is  desired  to  simulate  the  effects  of  varying  atmospheric  conditions  (including  winds)  upon  the 
performance  of  a  spaceplane  that  attempts  to  fly  an  around-the-world  mission  from  within  the 
continental  United  States,  en  route  releasing  a  payload  into  prescribed  near-orbit  conditions.  In 
order  to  reduce  the  influence  of  the  earth’s  rotation  and  to  more  accurately  assess  the  influence  of 
latitudinal  climate  differences,  trajectories  that  approximately  cross  the  earth’s  poles  are  considered. 

Flight  simulations  have  been  generated  by  combining  existing  software  to  model  trajectories  with 
software  to  simulate  random  atmospheres.  Specifically,  the  Program  to  Optimize  Simulated 
Trajectories  (POST  Version  5.1  -  see  Brauer,  1990),  developed  by  Martin  Marietta  Corporation, 
has  been  combined  with  the  NASA/MSFC  Global  Reference  Atmospheric  Model  -  1995  Version 
(GRAM-95,  discussed  by  Justus  et  al.,  1995).  Both  programs  have  been  slightly  modified  to 
reduce  execution  time  and  so  that  GRAM-95  can  be  called  as  a  subroutine  by  the  POST  program. 
The  specifications  of  a  sub-orbital  version  of  a  Reusable  Aero  Space  Vehicle  (RASV)  as 
envisioned  by  Boeing  Corp.  (see  also  Froning,  1996,  and  Johnson,  1992)  prescribe  the 
spaceplane  model  parameters  for  POST. 

The  Atmosphere  Model 

The  GRAM-95  computer  code  was  chosen  to  provide  atmospheric  data  because  it  is  a  well  tested 
program  that  can  provide  geographically  and  time  varying,  correlated,  random  values  of  necessary 
parameters.  It  computes  data  in  the  lower  (0-27  km),  middle  (20-120  km),  and  upper  (90+  km) 
altitudes  via  three  different  models  and  obtains  results  at  transition  altitudes  by  “fairing”  estimates 

29-3 


of  the  individual  models.  Temperature,  density,  pressure,  speed  of  sound,  and  wind  components 
are  the  quantities,  subsequently  used  in  (POST)  trajectory  calculations. 

The  middle  atmosphere  model  in  GRAM-95  is  based  on  data  from  the  Middle  Atmosphere 
Program  (MAP  -  see  Barnett  and  Comey,  1985a,  and  1985b)  and  provides  the  most  critical  data 
for  skipping  trajectories  to  POST  from  the  vehicle  altitude  and  latitude,  the  elapsed  time,  and  the 
month  of  the  year.  Diurnal  effects  are  not  modeled;  nor  are  solar  effects,  nor  gravity  waves. 

The  upper  and  lower  portions  of  the  atmosphere  have  a  smaller  influence  upon  spaceplane 
performance  since  little  time  is  spent  at  these  altitudes.  It  is  useful  to  understand,  however,  that  (i) 
in  the  lower  atmosphere  the  Global  Upper  Air  Climatic  Adas  (GUACA  Version  1.0  -  see  Ruth,  et 
al.,  1993),  provides  historical  data  from  which  random  parameters  are  obtained,  and  (ii)  in  the 
upper  atmosphere  model,  or  Jacchia  model  (see  Hickey,  1988a,  and  1988b),  influences  of  the  day 
of  the  month,  diurnal  variations,  and  solar  effects  (as  well  as  altitude,  latitude,  time  elapsed,  and 
month)  are  taken  into  account 

The  Baseline  Trajectory  Model  (Using  the  U.S.  Standard  Atmosphere,  1962) 

A  baseline  trajectory  was  generated  for  a  flight  of  the  RASV  from  Edwards  Air  Force  Base, 
California,  around  the  world  and  back,  by  two  POST  runs  using  a  polar  trajectory  headed  initially 
due  south,  through  a  geographically  uniform  U.S.  Standard  Atmosphere,  1962  (with  no  winds). 

In  the  first  run  of  the  baseline  case  POST  was  requested  to  maximize  the  weight  of  a  payload 
subject  to  prescribed  conditions  at  the  end  of  the  powered  phase  of  ascent: 

(i)  inertial  flight  path  angle  must  be  within  0.001 0  of  0.0°, 
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(ii)  inertial  velocity  is  required  to  be  within  10  fl/sec  of  25,8 14  ft/sec,  and 

(iii)  altitude  must  lie  within  100  ft  of  340,495  ft  (about  104  km). 


These  conditions  occurred  just  prior  to  an  unpowered  elliptical  orbit  (Hohmann)  transfer  to  an 
apogee  altitude  at  which  the  payload  could  be  released.  The  vehicle  assumed  a  horizontal  lift-off 
position,  was  sled-launched  at  an  azimuth  of  180°  (due  south)  with  an  initial  relative  velocity  of 
600  ft/sec  from  a  runway  at  2,276  ft  altitude,  and  was  powered  by  two  modified  space  shuttle  main 
engines  (SSMEs)  for  522.31  seconds  thereafter.  The  engines  were  assumed  to  initially  deliver 
1.01  million  lbs  of  thrust,  have  an  instantaneous  specific  impulse  (ISP)  of  449.8  sec  and  an  exit 
area  of  58  ft2.  When  the  RASV  reached  50,000  ft,  its  extendable  nozzles  were  moved  to  a  second 
position  so  that  1.041  million  lbs  of  thrust  was  delivered  with  an  ISP  of  463.5  seconds  and  an  exit 
area  of  174  ft2.  The  propellant  weight  decreased  from  1,090,700  lbs  as  shown  in  Fig.  1  and  the 
vehicle  dry  weight  was  134,300  lbs.  (not  including  a  payload  of  about  25,000  lbs)  for  a  gross 
takeoff  weight  of  about  1.25  million  lbs. 

RASV  PROPELLANT  WEIGHT 


Figure  1.  RASV  Propellant  Weight  as  a  Function  of  Time 
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It  was  found  that  a  payload  of  25,100  lbs  could  be  lifted  to  the  near-orbit  conditions  if  the  attack 
angles  shown  in  Table  1  were  employed  (with  bank  angle  zero). 


Table  1. 

Attack  and  Bank  Angles  for  RASV  Ascent 


Event 

Time  (sec) 

Attack  Angle  (deg) 

1 

0. 

3.5685925 

2 

I  15. 

6.9171659 

3 

35. 

2.4401747 

4 

75. 

1.0975112 

5 

175. 

2.1646243 

6 

295. 

1.684 

7 

345. 

0.81079 

8 

395. 

0.7131 

9 

445. 

-.23185 

The  attack  angles  and  payload  weight  were  inserted  into  a  second  POST  computation  in  which  the 
RASV  was  required  to  return  to  Edwards  AFB  from  the  payload  release  point  -  but,  without 
releasing  its  payload  (it  was  assumed  that  a  return  with  the  payload  would  be  more  difficult  than  a 
return  after  a  release,  due  to  additional  weight).  The  aerodynamic  heat  rate  was  constrained  to  a 
maximum  of  200  BTU/ftVsec1.  POST  was  requested  to  meet  these  objectives  by  choosing  an 
angle  of  attack  and  a  bank  angle  during  each  of  the  phases:  (i)  from  vehicle  apogee  until  70,000  lbs 
of  lift  was  generated,  and  (ii)  from  this  point  until  the  vehicle  relative  velocity  dropped  to  5,000 
feet  per  second.  From  this  point  the  RASV  was  (somewhat  unrealistically)  assumed  to  dive  to 
landing  by  use  of  a  constant  flight  path  angular  rate.  A  skipping  return  to  Edwards  AFB  was 
found  to  be  possible  using  attack  angles  of  6.704°  and  6.771°  with  corresponding  bank  angles  of  - 
12.82°  and  -12.91°  for  the  two  phases,  respectively.  The  baseline  ground  track  (subvehicle  path) 
appears  in  Fig.  4,  and  the  associated  skipping  profile  of  altitude  verses  time  is  presented  in  Fig.  5. 
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Based  upon  a  one  foot  radius  reference  sphere. 


optimize  r  a  s  v  a s c e n t  < p o 1  a r  leo)  with  heating  constraint 


Figure  4.  Ground  Track  for  the  Baseline  RASV  Skipping  Trajectory 


5  RASV  Baseline  Trajectory,  1962  Mean  Atmosphere 


Figure  5.  Skipping  Profile  for  the  Baseline  RASV  Trajectory 

•Mwte  Carlo  Flight  Simulations 

With  the  optimized  or  baseline  trajectory  for  the  uniform  atmosphere  complete,  it  was  possible  to 
execute  Monte  Carlo  POST  runs  with  GRAM-95  generated,  globally  varying,  random  atmospheres 
by  implementing  the  attack  angles,  bank  angles,  and  payload  weight  of  the  baseline  trajectory. 
Twenty  such  runs  for  each  of  January,  April,  July,  and  October  were  performed.  Each  run  was 
made  on  a  200  MHz  Pentium-class  PC,  and  required  approximately  2  minutes  using  POST’s 
projected  gradient  optimization  mode  with  zero  control  vector  iterations. 

Since  GRAM-95  requires  that  parameters  for  solar  activity  be  input  (only  for  effects  above  90  km, 
or  295,275  ft)  in  each  run,  random  solar  parameters  were  generated  externally.  The  3-hourly 
geomagnetic  index  ap  (used  in  the  GRAM  model  to  compute  a  geomagnetic  correlation  to  the 
exospheric  temperature)  was  selected  from  the  percent  frequency  distribution  shown  in  Fig.  6. 
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Figure  6.  Approximate  Historical  Percent  Frequency  of  Occurrence  for  Values  of  ap 


The  daily  and  mean  162-day  values  for  the  solar  10.7-cm  radio  noise  flux  were  assumed  to  be 
equal,  with  a  value  selected  randomly  by  use  of  the  uniform  distribution  U[40,240].  The  15th  day 
of  the  month,  and  Greenwich  mean  time  (UTQ  1:00  were  used  for  all  runs  -  they  only  influence 
GRAM-95  output  moderately  above  295,276  ft.  Unfortunately,  diurnal  variations  at  middle  and 
lower  altitudes  could  not  be  simulated. 


One  October  run  failed  to  meet  the  70,000  lbs  of  lift  baseline  phase  criterion,  so  phasing  was 
implemented  at  65,000  lbs  for  that  run;  however,  all  other  runs  completed  normally.  Since  Monte 
Carlo  runs  were  not  subjected  to  the  restrictions  of  the  baseline  run,  but  merely  used  the  same 
payload,  bank  angles,  and  attack  angles,  landing  location  varied  as  shown  in  Figure  7. 
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Figure  7.  Return  Sites  for  Monte  Carlo,  Around-the-World,  Skipping  Trajectories 


The  peak  altitudes  of  the  trajectories  averaged  only  425,239  ft  for  January,  but  averaged  470,713  ft 
for  April,  484,768  ft  for  October,  and  593,805  ft  for  July.  Vehicle  peak  heat  rate  was  higher  for 
the  higher  flights  -  averaging  (in  Btu/ft2/sec)  238.6  for  January,  276.2  for  April,  280.0  for  April, 
and  345.5  for  July;  however,  total  aerodynamic  heat  turned  out  to  be  lower  in  the  summer.  The 
situation  is  illustrated  in  Figures  8,  9,  and  10  for  the  case  of  an  especially  high  July  flight  and  an 
especially  low  January  flight  -  in  which  possible  re-radiation  of  heat  was  not  taken  into  account 

RASV  SKIPPING  PROFILE 


POLAR,  0NCE-AR0UND  TRAJECTORY 
FROM/TO  EDWARDS  AFB,  CA 
LOW  JANUARY  VS.  HIGH  JULY  TRAJECTORY 


Figure  8.  Altitude  Profiles  for  a  Low  January  Flight  and  a  High  July  Flight 
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Figure  9.  Heat  Rate  for  a  Low  January  Flight  and  a  High  July  Flight 


Figure  10.  Total  Heat  for  a  Low  January  Flight  and  a  High  July  Flight 
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Confidence  intervals  for  peak  altitude  and  peak  heat  rate  were  deduced  from  each  set  of  20  runs 
and  are  given  in  Tables  2  and  3. 


Table  2. 

Seasonal  90%  Confidence  Intervals  for  Peak  RASV  Altitude 


January 

April 

July 

October 

Lower  Altitude 
Limit  (ft) 

366,632 

403,071 

518,158 

412,436 

Upper  Altitude 
Limit  (ft) 

489,024 

533,342 

665,747 

546,614 

Table  3. 

Seasonal  90%  Confidence  Intervals  for  Peak  RASV  Heat  Rate 


1  1 

January 

April 

July 

October 

213.9 

225.7 

276.2 

230.5 

Upper  Heat  Rate 
Limit 

(Btu/ft2/sec) 

279.3 

320.9 

398.4 

330.5 

Despite  the  variability  in  peak  altitude,  total  energy  (mass*gravity*height  +  mass*velocity2/2)  was 
relatively  equal  among  trajectories  -  i.e.  high  trajectories  had  slightly  lower  velocities  at  apogee. 

Substantial  variations  were  also  noted  in  skipping  behavior.  Figures  11  and  12  illustrate  the  wide 
variation  in  amplitude  of  skips  late  in  flight  (without  a  change  in  attack  or  bank  angle).  Note  that 
Figure  11  does  not  display  one  of  the  80  Monte  Carlo  runs  but  a  January  run  without  winds  - 
including  winds  slightly  reduced  these  late  amplitudes. 

Some  additional  POST/GRAM-95  runs  were  made  in  order  to  estimate  the  influence  of  random 
winds  and  of  geomagnetic  storms.  Figure  13  shows  the  results  for  a  July  case  in  which  a  Monte 
Carlo  atmosphere  was  generated  (i)  with  and  without  the  presence  of  a  very  extreme  geomagnetic 
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Altitude  (ft) 


storm  (ap  =  400,  daily  fl0.7  flux  =  230, 162-day  average  flux  =  230),  and  (ii)  with  and  without 
random  winds.  Evidendy,  winds  significandy  impact  RASV  performance  but  solar  effects  are 
minimal;  it  should  be  recalled,  however,  that  GRAM-95  fails  to  apply  solar  influences  below  90 
km  (295,276  ft). 


WIND  AND  GEOMAGNETIC  STORM  EFFECTS 
JULY  RASV  SKIPPING  TRAJECTORY 


POLAR,  ONCE-AROUND  TRAJECTORY 
FROM/TO  EDWARDS  AFB,  CA 


Figure  13.  Geomagnetic  Storm  and  Wind  Influences  Upon  a  July  Trajectory 
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The  changes  in  trajectory  characteristics  within  a  given  month  were  typically  less  substantial  than 
those  between  months.  Figure  14  displays  the  variation  among  altitude  profiles  within  the  20 
January  simulations.  (Recall  from  Fig.  7  that  January  provided  the  largest  variety  of  return  sites  for 
the  given  set  of  baseline  controls.) 


RASV  SKIPPING  PROFILE 


POLAR,  ONCE-AROUND  TRAJECTORY 
FROM/TO  EDWARDS  AFB,  CA 
FOR  VARIOUS  JANUARY  CONDITIONS 


Figure  14.  Variation  Among  January  Skipping  Profiles 
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Summary 

A  sequence  of  80  Monte  Carlo,  geographically  varying  atmospheres,  together  with  flight 
parameters  obtained  from  a  baseline  trajectory  for  a  conceptual  Boeing  RASV  spaceplane  has 
indicated  that  around-the-world  polar,  skipping  trajectories  can  be  used  to  lift  payloads  of  about 
25,000  lbs  into  near-orbit  conditions  in  a  variety  of  atmospheric  cases.  The  released  payloads 
would  be  capable  of  lifting  satellites  into  low  earth  orbit  by  small  (e.g.  5,000  ft/sec)  upper  stages. 

Simulations  employed  a  sled-launched,  SSME  powered  vehicle  with  two-position  extendable 
nozzles  and  supposed  that  the  payload  returned  to  earth  (a  worst-case  condition)  with  the 
spaceplane. 

The  baseline  flight  parameters  were  optimized  in  order  to  lift  the  largest  possible  payload  into  the 
prescribed  near-orbit  conditions.  Some  flight  characteristics  (peak  altitude,  skip  amplitudes,  and 
heat  rate)  displayed  substantial  variability  due  to  atmospheric  and  wind  variations  when  flight 
controls  from  the  baseline  case  were  applied. 

Further  work*  is  required  in  order  to  determine  the  nature  of  larger  skipping  amplitudes  that 
occurred  late  in  flight  within  certain  atmospheres  and  to  find  an  ideal  skipping  motion  with  controls 
for  maintaining  the  motion. 
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thermodynamic  stability  and  oxidation  behavior  of 

REFRACTORY  (Hf,  Ta,  Zr)  CARBIDE/BORIDE  COMPOSITES  FOR 
ULTRA  HIGH  TEMPERATURE  APPLICATIONS 


Jenn-Ming  Y  ang 
Professor 

Department  of  Materials  Science  and  Engineering 
University  of  California,  Los  Angeles 


ABSTRACT 

The  multicomponent  thermodynamic  stability  diagrams  for  the  complex  (Hf,  Ta, 
Zr,  Si)-C(B)-0,  Hf-Ta-C(B)-0  and  Zr-Si-C(B)-0  systems  were  generated  using  the 
proposed  linear  inequality  method,  where  the  stability  area  of  compound  is  the  solution  of 
a  set  of  linear  inequalities  which  is  directly  obtained  from  the  free  energy  changes  of 
general  chemical  reactions.  The  thermodynamic  relationships  in  the  oxide  scale  and  the 
complicated  interfacial  reactions  in  the  high  temperature  oxidation  of  these  systems  were 
analyzed  using  the  generated  diagrams.  The  possible  phases  formed  on  the  HfC-TaC, 
HfB2-TaC,  Hffi2-TaB2,  HfC-TaB2,  ZrB2-SiC,  ZrC-SiCcomposites  at  high  temperatures 
were  predicted.  The  interactions  between  oxidation  byproducts  (CO,  C02,  B2Oj  gases), 
carbides  (or  borides)  and  porous  oxides  were  analyzed.  The  possible  kinetic  rate¬ 
controlling  processes  were  discussed  based  on  the  considerations  of  outward  and  inward 
gas  diffusion  model  and  outward  and  inward  ionic  diffusion  model  Finally,  attempts  to 
improve  the  oxidation  resistance  of  refractory  carbide/boride  composites  will  be  discussed. 
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THERMODYNAMIC  STABILITY  AND  OXIDATION  BEHAVIOR  OF 
REFRACTORY  (Hf,  Ta,  Zr)  CARBIDE/BORIDE  COMPOSITES  FOR 
ULTRA  HIGH  TEMPERATURE  APPLICATIONS 

INTRODUCTION 

Advanced  materials  with  temperature  capability  of  over  2000  °C  are  needed  for  ultra- 
high  temperature  structural  applications,  such  as  rocket  engines  and  thermal  protection 
systems  for  space  vehicles.  The  potential  ceramic  systems  that  can  operate  as  oxidation- 
resistant  materials  with  temperature  capabilities  from  2000  to  2400  °C  are  refractory 
oxides,  carbides,  borides  and  nitrides  as  shown  in  Figure  1.  Oxide-based  ceramics  are 
often  chemically  inert  but  are  more  brittle  than  the  non-oxide  based  ceramics.  Non-oxide 
ceramics,  on  the  other  hand,  are  generally  not  very  oxidation  resistant  when  heated  to  such 
high  temperatures  in  oxidizing  environment,  but  for  those  which  can  form  continuously 
protective  oxide  films  are  exceptions.  There  are  relatively  few  refractory  oxides  that  are 
stable  in  an  oxidizing  atmosphere  above  2000  °C.  Hafiiia  (melting  point  2900  °C)  and 
zirconia  (melting  point  2770  °C)  have  sufficiently  high  melting  temperatures  and  relatively 
low  vapor  pressures.  However,  they  undergo  solid  phase  transformations:  from 
monoclinic  to  tetragonal  structure  at  1150  and  1650  and  from  tetragonal  to  cubic  at  2370 
and  about  2700  °C  for  zirconia  and  hafiiia  respectively,  with  a  corresponding  large  volume 
change.  The  large  volume  change  would  result  in  the  destruction  of  any  large  scale 
component  made  from  these  materials,  and  so  in  practice  the  materials  must  be  stabilized 
with  an  appropriate  additive  such  as  calcium  oxide,  magnesium  oxide,  or  yttrium  oxide. 
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Refractory  carbides  and  borides  ofHf,  Zr  and  Ta  etc.  are  potential  candidates  for  ultra 
high  temperature  structural  applications  since  they  have  melting  temperatures  considerably 
higher  than  their  associated  oxides,  do  not  undergo  any  solid  phase  transformation,  and 
have  relatively  good  thermal  shock  resistance.  However,  very  limited  studies  15  have 
been  conducted  to  investigate  the  oxidation  behavior  of  refractory  carbides  and  borides  at 
temperatures  above  2000  °C.  The  oxidation  processes  of  refractory  carbides  and  borides 
have  been  shown  to  be  the  combined  processes  of  oxygen  inward  or  metal  ion  outward 
diffusion  and  gaseous  (  or  liquid  at  relatively  lower  temperatures )  byproduct  outward 
diffusion  through  oxide  scale  6-10 .  Therefore,  the  oxidation  resistances  of  carbides  and 
borides  are  mainly  influenced  by  the  formation  and  escaping  of  gaseous  byproducts  (  such 
as  CO,  C02,  B203)  during  the  oxidation  processes  which  are  significantly  different  from 
those  of  their  metal  counterparts.  Typically,  the  oxide  scale  formed  at  high  temperature 
consists  of  at  least  two  distinctive  layers:  (1)  a  much  less  porous  inner  oxide  layer  and  (2) 
a  porous  outer  oxide  layer  1112 .  However,  Bargeron  et  aL1  indicated  that,  in  the  oxidized 
HflC  film  an  oxycarbide  (HfO  2.xCy)  interlayer  was  found  between  the  outer  porous  HfD2 
layer  and  a  residual  carbide  layer  with  dissolved  oxygen  in  the  lattice.  The  oxide  interlayer 
was  found  to  be  a  better  diffusion  barrier  for  oxygen  than  either  the  hafnium  oxide  or 
carbide  layers.  The  presence  of  oxycarbide  indicated  that  the  oxygen  partial  pressure 
would  arrive  to  a  relatively  lower  value  at  carbide  and  oxycarbide  interface.  Various 
diffusion  models  have  also  been  proposed  to  describe  the  oxidation  behavior  of  refractory 
carbide  and  borides.  Holcomb  et  aL  13  proposed  a  counter-current  gaseous  diflusion 
model  to  describe  the  oxidation  behavior  of  HfC.  Bargeron  et  aL1  proposed  an  oxidation 
model  for  HfC  based  on  moving-boundary  diffusion  theory.  Attempts  have  also  been  made 
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to  improve  the  oxidation  resistance  of  refractory  carbides  and  borides  with  appropriate 
additives.  For  example,  the  addition  of  SiC  can  improve  the  oxidation  resistance  of  both 
HfB2  and  ZrB2  5  .  However,  much  more  experimental  work  and  theoretical  study  need  to 
be  conducted  to  better  understand  the  oxidation  behavior  of  the  refracroty 
carbides/borides. 

The  major  objective  of  this  research  is  to  construct  the  multicomponent  thermodynamic 
stability  diagrams  using  the  linear  inequlity  methods  for  predicting  the  sequence  of  the 
layers  for  gas-solid-solid  interfacial  reactions  and  to  interpret  the  oxidation  behavior  of 
several  refractory  carbide/boride  composites.  The  applications  of  these  thermodynamic 
stability  diagrams  will  be  discussed.  Furthermore,  the  possible  gaseous  and  ionic  diffusion 
models  will  be  discussed  based  on  information  obtained  in  the  generated  thermodynamic 
stability  diagrams. 


THERMODYNAMIC  MODELS  FOR  THE  GENERATION  OF 
MULTICOMPONENT  THERMODYNAMIC  STABILITY  DIAGRAMS 

Since  the  potential-pH  diagrams  were  first  proposed  by  Pourbaix  14,  the  potential- 
pH  diagrams,  two  partial  pressure  diagrams  etc.  for  metal-nonmetal-nonmetal  systems 
have  been  widely  used  in  many  fields  of  corrosion  science  and  metallurgy  etc..  Methods  of 
calculating  thermodynamic  equilibria  can  be  categorized  into  two  approaches,  namely,  the 
stoichiometric  and  non-stoichiometric  ones 15  .  Stoichiometric  approach  makes  use  of 
independent  reactions  and  their  equilibrium  constants,  whereas  the  non-stoichiometric  one 
is  based  on  linear  equations  in  terms  of  the  chemical  potentials  of  system  components  to 
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obtain  the  triple  points  of  the  stability  areas  of  three  condensed  species  considered. 
Yokokawa  et  aL  16-22  first  proposed  to  use  the  generalized  chemical  potential  diagrams 
for  metal-metal-nonmetal  systems  in  which  Log(aMi/aM2)  coordinate  was  successfully 
introduced  to  the  generated  diagrams  by  the  non-stoichiometric  method.  The  introduction 
of  Loga(Mi/aM2)  coordinate  makes  the  thermodynamic  stability  (  or  phase  stability ,  or 
chemical  potential)  diagrams  much  more  powerful  than  one  expected  in  the 
thermodynamic  stability  diagrams  which  only  deal  with  pure  metals.  The  generation  of 
such  thermodynamic  stability  diagram  can  also  be  done  by  stoichiometric  method  as 
outlined  in  this  section. 

For  a  system  consisting  of  N  different  possible  condensed  compounds  (solid  or  liquid), 
the  stability  area  of  certian  compound  I  (KI<N)  corresponds  to  the  region  in  which  the 
free  enery  changes  of  N-l  chemical  (or  electrochemical)  reactions  of  compound  I  to 
compound  1, ... ,  1-1, 1+1, N,  respectively ,  at  certain  condition  are  either  greater  than 
or  equal  to  zero,  ff  one  of  three  parameters  for  the  metal-nonmetal-nonmetal  systems  (Le. 
T,  pH  and  electrode  potential  in  the  potential-pH  diagrams;  T  and  two  partial  pressures  in 
the  partial  pressure  diagrams  and  partial  pressure-  1/T  diagrams)  is  constant,  the 
thermodynamic  stability  diagrams  are  two-dimensional  If  two  of  four  parameters  for 
metal-metal-nonmetal-nonmetal  systems  or  one  of  three  parameters  for  metal-metal- 
nonmetal  systems  are  constant,  the  multicomponent  thermodynamic  stability  diagrams  are 
also  two-dimensional  The  geometrical  shape  of  the  stability  area  of  compound  is  a  convex 
polygon.  It  is  known  that  the  stability  area  of  compound  can  be  considered  to  be  the 
solution  of  a  set  of  linear  inequalities  which  is  directly  obtained  from  the  free  energy 
changes  of  chemical  (or  electrochemical)  reactions. 
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The  multicomponent  thermodynamic  stability  diagrams  of  metal-metal-nonmetal 
systems  will  be  selected  as  an  example  to  describe  the  construction  of  a  set  of  linear 
inequailities.  Two  additional  examples  to  illustrate  howto  obtain  the  Log(aMi/aM2)  and 
Log(aM3/aM2>  coordinates  in  two  or  three  metallic  component  systems  are  given  in  the 
Appendix. 

An  A-B-0  ternary  system  is  considered  here  (  A  and  B  are  metallic  elements,  O  being 
oxygen;  these  are  the  system  components).  If  there  are  N  possible  condensed  species,  then 
the  reactions  between  condensed  species ,  two  metallic  components  and  oxygen  can  be 
studied  in  terms  of  chemical  reactions,  which  proceed  according  to  the  general  equation: 

mijAxjByjOzi  -  nijA  +  nijB  +  lijC>2  =====  AXjByjOZj  (1) 

where  A^Oa  and  A^Og  are  the  general  forms  of  condensed  species  in  A-B-0 
ternary  systems,  xi,  yi,  zi,  xj,  yj  and  zj  are  the  stoichiometric  numbers  of  these  two 
compounds,  respectively. 

The  elemental  balances  for  each  of  the  three  elements,  Le.  A,  B  and  O,  are  then  used  to 
find  nij,  mij  and  lij  for  above  reaction ,  and  following  equations  can  be  yielded, 
respectively: 


mij  =  (xj  +  yj)/(xi  +  yi) 

(2) 

nij  =  (yjxi  -xjyi)/(xi  +  yi) 

(3) 

lij  =  (xizj  +  yiq  -  xjzi  -yjzi)/2(xi  +  yi) 

(4) 

where  i=l, ...,  N;  j=l, ...,  i-1,  i+1, N. 

According  to  the  thermodynamic  properties  of  equilibria  and  from  the  general  reactions 
(1),  the  stability  area  of  substance  i  (A^Oi)  (where  i  is  the  certain  value  l<i<N) 


30-  7 


obtained  by  comparison  with  other  substances  j  (AgByjOg)  (where  j=l,  — ,  i-1,  i+1.  •••» N) 
must  satisfy  to  the  following  conditions: 

Gy  =  Gij°  +  RTlnKpij  >0 

where  Gy°  —  G° AxjByjog  ~  lijG0 02  +  nijG  a  •  nijG  b  “  niijG  AxiByiOzi,  and 
Kpij  =  8^0^02  a^Wzi  l  Kpij  *  the  thermo  dynamic  equilibrium  constant 

for  recation  (1),  aA  and  aB  are  the  activities  of  two  metallic  components,  aAxiByiozi  and 
aAxjByjOzj  are  the  activities  of  two  condensed  species  considered  having  the  stability  areas, 

P02  is  the  oxygen  partial  pressure,  respectively. 

Suppose  that  all  the  activities  of  condensed  species  are  equal  to  1  (  Le.  a  AxiByiOzi  = 

aAxjByjOzj  1  )?  and  let. 
kij  =  Gij  /  2.303RT 
Xi  =  -  LogP02 
X2  =  +  Log  (aA  /  aB) 

,  then  N- 1  linear  inequalities  can  be  written  as: 

Jdj  +  lijXi+nijX2>0  (5) 

where  i  is  the  certain  value  (1<  i  *^N);  j=l, ...,  i-1,  i+l»  •••?  N. 

Let: 

C  =  kii  Cii-i  =  Idj 

A-  =  Uj  when  i>j  Ay-i  =  lij  wheni<j 

By  =  nij  Bij.i  =  nij 

,  then  the  N-l  linear  inequalities  (5)  can  be  changed  into: 

AjjXi  +  BijX2  >  Cy  (6) 

where  j=l, ...,  N-l. 
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The  set  of  linear  inequalities  (6)  defines  the  stability  area  of  condensed  compound 
i  (AxjByiOzi).  When  i  changes  from  1  to  N,  the  N  sets  of  linear  inequalities  as  (6)  can  be 
obtained  in  the  same  way. 

When  there  are  many  compounds  to  be  considered  in  the  system,  it  is  necessary  to  use 
computer  program  to  generate  the  thermodynamic  stability  diagrams.  The  computer 
algorithm  for  the  generation  of  the  phase  stability  diagrams  had  been  proposed  by  Wang  et 
aL23 ,  in  which  a  series  of  relevant  objective  functions  with  minimum  numbers  were 
introduced  to  the  set  of  linear  inequalities  to  form  the  problems  of  linear  programming. 

The  FORTRAN  program  had  been  constructed  to  produce  the  phase  stability  diagrams  by 
the  use  of  Revised  Simplex  Method  that  was  used  to  slove  the  problems  of  linear 
programming  .  Various  researchers  24  29  had  also  developed  the  computer  algorithms  and 
produced  the  phase  stability  diagrams  which  only  deal  with  pure  metals.  The  generated 
thermodynamic  stability  diagrams  by  the  above  linear  inequality  method  are  equivalent  to 
those  generated  by  Yokokawa  et  al’ s  chemical  potential  method.  The  slopes  of  boundary 
lines  can  be  directly  obtained  from  the  specific  general  reactions  ( for  example,  in  equation 
(1),  the  slope  of  boundary  line  for  the  stability  areas  of  two  possible  condensed  species  is 
nij/lij).  The  convex  polygon  stability  area  of  certain  condensed  substance  is  directly 
obtained  by  the  solution  of  a  set  of  linear  inequalities.  However,  unlike  chemical  potential 
method  proposed  by  Yokokawa  et  al  16-17  where  the  fundamental  stoichiometric  number 
matrix  represents  linear  equations,  the  fundamental  linear  inequalities  with  at  least  one 
coordinate  of  Log(aMi/aM2)  in  the  linear  inequality  method  are  directly  obtained  from  the 


30-9 


free  energy  changes  of  the  specific  general  chemical  (  or  electrochemical )  reaction 
equation  as  shown  in  (1).  Such  kinds  of  specific  general  chemical  (  or  electrochemical ) 
reaction  equations  can  be  obtained  in  any  system  which  includes  at  least  two  metallic 
components  and  one  non-metalhc  component. 


THERMODYNAMIC  DATA 

Thermodynamic  and  physical  property  data  were  collected  from  different  sources. 
However,  Thermochemical  Properties  of  Inorganic  Substances 30  and  JANAF 
Thermodynamic  Tables  31  were  used  as  the  primary  sources  for  free-energy  values,  heat 
of  formation,  heat  capacity  (Cp),  and  entropy  values.  These  information  are  listed  m 
Table  I. 

RESULTS  AND  DISCUSSION 
1.  Partial  Pressure  Diagrams 

The  generated  LogPcrc-LogPco,  LogPo2-LogPco2,  LogPcorLogPco  and 
LogPo2-LogPB203  diagrams  for  Hf-C(or  B>0,  Zr-C(or  B>0,  Ta-C(or  B)-0  and  Si-C- 
O  systems  at  2300  K  are  shown  in  Figs.2-5,  respectively.  From  the  analysis  of  these 
thermodynamic  stability  diagrams,  it  is  known  that,  in  the  temperature  ranges  of 2000- 
2700  K,  hafnium,  zirconium,  and  tantalum  oxides  can  be  formed  in  a  wide  range  of  O2, 
C02,  CO  and  B2O3  partial  pressure.  Obviously,  hafnium,  zirconium  and  tantalum  borides 
are  not  stable  in  oxidizing  atmosphere  and  will  directly  form  oxides.  However,  ha&ium, 
zirconium  and  tantalum  carbides  could  be  stable  in  the  reducing  atmosphere  of  LogP02  = 
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-10  to  -16,  and  LogPco  =  -7  to  -1  (atm.).  Thus,  hafeium,  zirconium  and  tantalum 
carbides  could  be  the  high  temperature  oxidation-resistant  materials  in  the  very  low 
oxygen  and  high  carbon  activity  atmospheres. 

Thermodynamic  analysis  also  provides  an  insight  into  the  mechanism  of  the  formation 
of  outer  porous  oxide  layer  during  oxidation  of  refractory  carbide  and  boride  as  reported 
by  Bargeron  et  aL  and  Courtright  et  al  etc..  For  example,  the  thermodynamic  stability 
boundary  line  ofHfC/Hf02  is  obtained  from  the  following  two  reactions: 

HfC  +  3/202  =====  HfD2  +  CO  (7) 

HfC  +  202  —  Hf02  +  C02  (8) 

There  exists  another  equilibrium  reaction  in  the  gas  environment: 

CO  +  I/2O2  —  C02  (9) 

Table  2  shows  the  equilibrium  CO,  C02  partial  pressures  with  different  oxygen  partial 
pressure  at  the  HfC/Hf02  interface  at  2300  K  From  the  analysis  of  table  2,  it  is  known 
that  when  the  oxygen  partial  pressure  is  higher  than  10  12  atm,  the  CO  equilibrium 
partial  pressure  could  be  extremely  high.  As  a  result,  if  the  oxygen  partial  pressure  at 
carbide/oxide  interface  remains  a  relatively  high  value,  the  formed  oxide  layer  could  be 
destroyed  by  the  entrapped  gas.  In  this  case,  the  formation  of  porous  oxide  layer  is 
necessary  to  allow  the  escaping  of  gas  and  to  maintain  the  integrity  of  the  oxide  scale. 
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Table  2.  The  equilibrium  partial  pressures  at  HfC/HflCh  interface  at  2300  K 

LogPoi  |  -1  I  *10  I  "12  I  _14  I  '16  I  "18 
LogPco  I  +16.927  |  +3.427  |  +0.427 1  -2.573  |  -5.573  |  -8.573 

LogPcoa  I  +18.354  |  +0.354  |  -3.646  |  -7.646  |  -11.646  |  -15.646 


2.  The  Log(aA/aB)  vs  LogPca,  Log(aA/aB)  vs  LogPco  (or  LogPCo2)  and 
Log(aA/aB)  vs  LogpB203  Diagrams 

The  generated  diagrams  of  Log(aTa/aHf)  vs  Logpo2  (  LogPco,  LogPco2  and  LogPB203) 
for  Hf-Ta-C(B)-0  systems  and  Log(aSi/azr)  vs  LogP02  (  LogPco,  LogPco2  and  LogPB203> 
for  Zr-Si-C(B)-0  systems  are  shown  in  Figs.6-15,  respectively.  From  these  diagrams,  it 
is  very  clear  that  when  oxygen  partial  pressure  remains  relatively  high  values,  the  stability 
areas  of  oxides  in  the  oxide  scale  depend  only  on  the  activities  of  metallic  components.  It 
is  well  known  that  the  activity  gradient  of  metallic  component  across  the  scale  is  mainly 
dependent  on  the  diffiision  coefficient  of  metallic  ion  through  the  oxide  layer.  The  metallic 
component  which  has  the  highest  diffiision  coefficient  is  usually  formed  as  the  outermost 
oxide  layer.  The  metallic  component  winch  has  the  lowest  diffiision  coefficient  is  usually 
formed  as  the  inner  oxide  layer.  Based  on  the  consideration  of  diffiision  effect,  the 
thermodynamic  stability  diagrams  can  be  directly  used  to  predict  the  formation  of  possible 
phases  during  the  oxidation  of  HfBz-TaC,  HfC-TaC,  HfB2-TaB2,  HSC-TaB2,  ZrB2-SiC, 
and  ZrC-SiC  composites.  In  the  following,  a  detailed  analysis  in  the  oxidation  processes 
of  several  refractory  carbide/  boride  composites  will  be  presented. 
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2. 1  The  interfacial  reaction  and  layer  sequence  in  the  high  temperature  oxidation  of  HfC- 
TaC  composite 

The  high  temperature  oxidation  of  HfC-TaC  composite  in  the  temperature  range  of 
1400-2200  °C  was  studied  by  Courtringt  et  aL  2  and  Patterson  et  ai  4 .  It  was  found 
that  the  oxide  layers  that  formed  on  HfC-TaC  obeyed  the  parabolic  growth  kinetics. 
However,  there  was  a  break  in  the  kinetics  around  1800  °C.  The  oxidation  kinetics  of 
HfC-TaC  above  1800  °C  were  in  fair  agreement  with  those  reported  for  the  Hf-Ta  alloys. 
Nevertheless,  below  1800°C,  the  notable  difference  between  the  oxide  layer  that  formed 
on  the  carbide  and  the  metallic  alloy  system  was  observed.  The  Hf-Ta  alloy  formed  a 
dense,  tenacious,  crack-resistant  oxide;  whereas,  the  carbide  developed  an  oxide  that  was 
quite  porous  and  prone  to  cracking.  The  oxide  layer  that  formed  on  the  carbide  was  a 
mixture  of  Hf02  and  Ta2H&Oi9  (  >1900  °C ).  Around  1800  °C,  the  outermost  oxide  layer 
was  found  to  be  Ta205.  This  indicated  that  the  diffusion  coefficient  of  Ta  ion  through  the 
oxide  is  greater  than  that  of  Hf  ion  at  this  condition. 

From  the  generated  thermodynamic  stability  diagram  for  the  HfC-TaC  composite  as 
shown  in  Fig.  10,  the  sequence  of  porous  oxide  layer  would  be  Ta6HF20i9  (  or  Ta2Os) 
(outermost )/Hf02(  inner  porous  oxide  layer).  In  the  outer  porous  oxide  layer,  because 
oxygen  partial  pressure  remains  relatively  high  values,  the  oxide  layer  sequence  is  mainly 
dependent  on  the  diffusion  coefficients  of  metallic  ions  through  the  oxide  layer.  The 
possible  oxides  (Ta205,  Ta<>Hf20i9  and  Hf02)  are  all  thermodynamically  stable  in  different 
Log(aTa/aHf)  ranges.  The  influence  of  oxygen  partial  pressure  (in  the  range  LogP02  = 

-9  to  0)  on  the  stability  areas  of  various  possible  oxides  are  almost  negligible.  The  stability 
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areas  of  various  oxides  are  mainly  dependent  on  the  activities  of  metallic  components. 
Thus,  Ta205  or  Ta6Hf20i9  is  formed  as  the  outermost  layer  because  the  activity  of  Ta  ion 
is  much  greater  than  that  of  Hf  ion.  Hf02  is  formed  as  the  inner  porous  oxide  layer 
because  Hf02  is  stable  even  in  the  lower  Hf  activity  range  (  Log(aH/aTa)  >  -4)  and 
relatively  low  oxygen  partial  pressure.  The  layer  sequence  in  the  less  porous  inner  oxide 
layer  would  be  Hf02/HfCx0y+Ta(TaC).  In  this  layer,  the  stabilities  of  oxides  and  carbides 
and  interfacial  reactions  are  strongly  influenced  by  oxygen  partial  pressure  and  activities 
of  metallic  components.  The  results  obtained  from  the  thermodynamic  analysis  are  in 
good  agreement  with  the  experimental  results. 

2.2  The  interfacial  reaction  and  layer  sequence  in  the  high  temperature  oxidation  of 

ZrB2(or  ZrC)/SiC 

The  effect  of  an  SiC  addition  on  the  oxidation  of  ZrB2  up  to  1500  °C  was  studied  by 
Tripp  et  aL  32 .  The  experimental  results  showed  that  in  the  temperature  range  of  1300  - 

1500  °C,  SiC  additions  to  ZrB2  improve  its  oxidation  resistance.  The  Si02-rich  glassy 
layer  was  found  to  be  the  outermost  oxide  layer,  whereas  Zr02-rich  oxide  layer  was  the 
inner  oxide  layer.  SiC  could  coexist  with  Zr02  in  the  inner  oxide  layer. 

The  generated  thermodynamic  stability  diagrams  for  SiC/ZrB2  at  1700  K  at  different 
oxygen  partial  pressures  are  shown  in  Figs.  16-17,  respectively.  From  the  thermodynamic 
stability  diagrams,  it  is  known  that  the  complex  oxide  SiZr04  has  the  stability  area  and 
might  be  formed  during  oxidation  process.  However,  die  complex  oxide  SiZrOt  is  not 
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stable  when  T  >  1680  °C.  At  higher  temperature,  the  oxides  which  can  be  formed  on  the 
surface  are  Zr02  and  Si02  (s,l).  Thus,  the  sequence  of  the  multilayer  oxide  scale  at  higher 
temperatures  will  be  different  from  that  at  lower  temperatures.  The  layer  sequence  in  the 
temperature  range  of  1300  -  1500  °C  would  be  Si02  (outermost )/SiZr04  +  Si02/SiZr04 
+Zr02/Zr02+  SiC/ZrB2+SiC.  The  sequence  of  the  multilayer  oxide  scale  in  the 
temperature  range  of  1700  -  2200  °C  would  be  Si02(l)  (outermost )/Si02(l)  +  Zr02/Zr02 
+  SiC/ZrB2  +  SiC.  The  oxidation  behavior  of  ZrB^SiC  composite  in  the  temperature 
range  of  1800-2400  °C  had  been  studied  by  Bull  et  aL  5 .  The  experimental  results 
confirmed  that  silicon  oxide  was  formed  as  the  outermost  layer,  while  zirconium  oxide 
was  formed  as  inner  oxide  layer.  Silicon  carbide  coexisted  with  zirconium  oxide  in  the 
internal  oxidation  region. 

It  is  well  known  that  the  formation  of  liquid  glass  (  such  as  B202(1)  in  the  temperature 
range  500  -  1000  °C  and  Si02(l)  when  T  >  1683  °C  )  can  separate  two  totally  different  gas 
environments  (  one  with  high  oxygen  partial  pressure  and  another  one  with  very  lower 
oxygen  partial  pressure).  The  formation  of  liquid  Si02  ( T  >  1700  °C  )  might  serve  as  a 
diffusion  barrier.  However,  at  higher  temperatures,  the  volalitization  of  SiO(g)  has 
significant  influence  on  the  stability  of  liquid  Si02.  The  effect  of  SiC  additive  on  the 
oxidation  resistance  of  ZrB2  is  greatly  limited  to  the  formation  of  SiO  (g).  The 
thermodynamic  boundary  line  of  SiC/Si02  is  obtained  from  the  following  two  reactions: 


SiC  +  3/202  = 

Si02  +  CO 

(10) 

SiC  +  202 - 

- Si02  +  CO2 

(11) 
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There  exists  a  volatilization  equilibrium  between  SiO(g)  and  Si02: 


Si02(l)  - SiO(g)  +  1/202  W 

Table  3  shows  the  equilibrium  partial  pressures  at  SiC/Si02(l)  and  SiO(g)/Si02(l) 
interfeces  at  2000,  2300  and  2500  K,  respectively.  From  the  analysis  of  table  3, 
the  equilibrium  SiO(g)  partial  pressure  decreases  with  increasing  oxygen  partial 
pressure  and  increases  with  increasing  temperature.  Thus,  at  higher  temperatures  (  T  > 
2500  K),  the  volatilization  of  SiO(g)  becomes  the  dominant  process  through  reaction  (10) 
to  reaction  (12).  In  this  case,  Si02(l)  formed  on  the  surfece  will  volalitilize  rapidly  and  the 
glassy  liquid  film  can  not  serve  as  an  effective  diffusion  barrier.  However,  at  lower 
temperature  (T  <  2300  K  ),  glassy  Si02  (1)  film  is  stable  and  can  be  used  as  the  diffusion 
barrier.  The  existence  of  such  diffusion  barrier  would  greatly  improve  the  oxidation 
resistance  of  carbide  or  boride. 


Table  3  The  equilibrium  partial  pressures  of  SiO(g),  CO  and  C02  with  different  P02 
|  T  =  2000  K  |  T  =  2300  K  |  T  =  2500K 

LogPoJl  -1  I  -10  I  -14  |  -1  I  -10  |  -14  |  -1  |  -10  |  -14 

LogPco  |  +19.099  |  +5.599  |  -0.401 |+15.932|+2.432|-3.568|+14.248|+0.748|-5.252 
LogPco2|+21.471  1+3.471 1  -4.529  |+17.359|-0.641|-8.641|+15.174|-2.826  |-10.826 
LogPao  |  -7.195  |  -2.695|  -0.695  |  -4.511 1-0.011|+1.989|  -3.406  |+1.094|+3.094 
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CONCLUSION 


High  temperature  oxidation  of  refractory  carbide  or  boride  is  complicated  by  the 
formation  of  gaseous  oxidation  byproducts.  In  order  to  understand  the  oxidation 
mechanism,  both  outward  diffusion  (  or  escaping  )  of  metallic  ions  and  formed  gases  and 
inward  diffusion  of  oxygen  should  be  considered  based  on  thermodynamics  and  kinetics  of 
various  possible  interfacial  reactions.  The  thermodynamic  stability  diagrams  which  include 
at  least  two  metallic  components  can  clearly  represent  the  stability  area  of  complex  oxide 
in  certain  Log(aMi/aM2)  range.  The  possible  oxides  which  are  stable  and  can  be  used  as  the 
oxidation-resistant  materials  in  the  temperature  range  of  2000  -  2500  °C  are  Hf02  and 
ZrC>2.  However,  the  high  oxygen  anionic  mobilities  and  porous,  defective  structures  of 
these  two  oxides  formed  during  the  oxidation  processes  of  their  carbides  or  borides 
indicate  that  hafnium  and  zirconium  carbides  or  borides  alone  can  not  be  used  as  high 
temperature  oxidation-resistant  materials  for  extended  period  of  time.  In  order  to  improve 
the  oxidation  resistance  of  hafiiium  and  zirconium  carbides  or  borides,  attempts  should  be 
directed  toward  incorporating  of  appropriate  additives  such  as  SiC,  TiC,  etc.  AI2O3  may 
be  another  possible  additive.  The  incorporation  of  these  additives  to  the  base  hafiiium  and 
zirconium  carbides  or  borides  may  lead  to  the  formation  of  solid  ( most  likely  complex 
oxide  )  or  viscous  liquid  diffusion  barrier.  Liquid  Si02  film  can  be  used  as  the  diffusion 
barrier  when  T<  2300  K  Liquid  AI2O3  and  TiOx  films  may  be  used  as  the  diffusion 
barrier  when  T  <  2700  K.  Furthermore,  liquid  AI2O3  and  TiOx  may  react  with  HfD2  or 
Zr02  to  form  various  solid  complex  oxides  in  the  temperature  range  of 2200  - 
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2500  K.  These  complex  oxide  layers  may  also  serve  as  diffusion  barriers  to  further 
improve  the  oxidation  resistance.  Experimental  verification  is  in  progress  to  elucidate  the 
role  of  additives  on  the  oxidation  behavior  of  refractory  carbide/boride  composites. 


APPENDIX 


The  approach  to  obtain  the  Log(aMi/aM2)  and  Log(aM3/aM2)  coordinates  in  two  or 
three  metallic  component  systems  is  illustrated  here. 

For  an  A-B-O-S  system  (  A  and  B  are  metallic  elements,  O  and  S  are  non-metallic 
elements,  respectively;  these  are  the  system  components),  the  specific  general  reaction 
equation  can  be  written  as  following: 

mijAwiBxiOyiSzi  -  nijA  +  nijB  +  lijO  +  kijS  =====  A^BxjOyjSq  (13) 

where  mij,  nij,  lij  and  kij  are  directly  defined  by  stoichiometric  numbers  in  the  two 
condensed  substances  i  and  j.  The  following  four  equations  are  obtained  based  on  the 
consideration  of  elemental  balances: 


mij  =  (wj  +  xj)/(wi  +  xi) 

(14) 

nij  =  (wixj  -  wjxi)/(wi  +  xi) 

(15) 

lij  =  (yjwi  +  yjxi  -  yiwi  -  yixi)/(wi  +  xi) 

(16) 

kij  =  (zjwi  +  zjxi  -  ziwi  -  zba)/(wi  +  xi) 

(17) 

The  Log(aA/as)  coordinate  is  then  obtained  directly  from  the  equilibrium  constant  Kpij 
for  the  reaction  equation  (13). 

For  an  A-B-C-0  system  (  A,  B  and  C  are  metallic  elements,  O  is  nonmetallic  element, 

respectively;  these  are  the  system  components).  The  specific  general  reactions  can  be 
written  as  following: 
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mijAiviBxiCyiOzi  -  nijA  -  UjB  +  (nij  +  lij)C  +  kijO 

:  AwjBxjCyjOg  (1^) 

where  my,  nij,  lij  and  kij  are  given  by  following  equations: 

mij  =  (wj  +  xj  +  yj)/(wi  +  xi  +  yi) 

(19) 

nij  =  (wixj  +  wiyj  -  wjxi  -  wjyi)/(wi  +  xi  +  yi) 

(20) 

lij  =  (wjxi  +  yjxi  -  wbg  -  yixj)/(wi  +  xi  +  yi) 

(21) 

kij  =  (zjwi  +  zjxi  +  zjyi  -  ziwj  -  zixj  -  ziyj)/(wi  +  xi  +  yi) 

(22) 

The  Log(aA/ac)  and  Log(aB/ac)  coordinates  are  then  obtained  directly  from  the 
equilibrium  constant  Kpij  for  the  reaction  equation  (18). 
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